Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
whoisjones
's Collections
General NER training datasets
MastermindEval
MastermindEval
updated
18 days ago
Evaluating reasoning capabilities of LLMs using the game of Mastermind (paper is coming)
Upvote
-
flair/mastermind_35_mcq_random
Viewer
•
Updated
13 days ago
•
37.1k
•
109
flair/mastermind_46_mcq_random
Viewer
•
Updated
13 days ago
•
36.1k
•
121
flair/mastermind_46_mcq_close
Viewer
•
Updated
13 days ago
•
36.1k
•
123
flair/mastermind_24_mcq_random
Viewer
•
Updated
13 days ago
•
30.4k
•
111
flair/mastermind_24_mcq_close
Viewer
•
Updated
13 days ago
•
30.4k
•
113
flair/mastermind_35_mcq_close
Viewer
•
Updated
13 days ago
•
37.1k
•
150
Upvote
-
Share collection
View history
Collection guide
Browse collections