Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yujun Zhou's picture
2 14 1

Yujun Zhou

yujunzhou
John6666's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
submitted a paper 11 days ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
updated a model 11 days ago
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
View all activity

Organizations

None yet

yujunzhou 's collections 1

EVOL-RL
The models trained with EVOL-RL
  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

    4B • Updated Sep 13 • 4
  • yujunzhou/EVOL-RL-MATH-500-Qwen3-4B-Base

    4B • Updated Sep 13 • 5
  • yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

    4B • Updated Aug 17 • 4
  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

    8B • Updated Sep 18 • 7
EVOL-RL
The models trained with EVOL-RL
  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-4B-Base

    4B • Updated Sep 13 • 4
  • yujunzhou/EVOL-RL-MATH-500-Qwen3-4B-Base

    4B • Updated Sep 13 • 5
  • yujunzhou/EVOL-RL-AIME24-Qwen3-4B-Base

    4B • Updated Aug 17 • 4
  • yujunzhou/EVOL-RL-MATH-Train-Qwen3-8B-Base

    8B • Updated Sep 18 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs