Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:

Duplicated from  open-r1/open-r1-eval-leaderboard

yentinglin
/
zhtw-reasoning-eval-leaderboard
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
zhtw-reasoning-eval-leaderboard / eval_results
7.04 MB
  • 3 contributors
History: 20900 commits
yentinglin's picture
yentinglin
Upload eval_results/mistralai/Mistral-Small-24B-Instruct-2501/main/aime24/results_2025-02-14T04-20-38.981835.json with huggingface_hub
a9dcad5 verified 9 months ago
  • HuggingFaceTB
    Upload eval_results/HuggingFaceTB/SmolLM2-1.7B-Instruct/main/gsm8k/results_2025-02-12T14-56-55.504908.json with huggingface_hub 9 months ago
  • deepseek-ai
    Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-02-12T15-44-45.830026.json with huggingface_hub 9 months ago
  • mistralai
    Upload eval_results/mistralai/Mistral-Small-24B-Instruct-2501/main/aime24/results_2025-02-14T04-20-38.981835.json with huggingface_hub 9 months ago
  • open-r1
    Upload eval_results/open-r1/DeepSeek-R1-Distill-Qwen-1.5-GRPO-v04.11/main-step-000000227/aime24/results_2025-02-13T07-01-04.303384.json with huggingface_hub 9 months ago
  • open-thoughts
    Upload eval_results/open-thoughts/OpenThinker-7B/main/math_500/results_2025-02-10T14-45-37.816597.json with huggingface_hub 9 months ago