Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
14
1
Yujun Zhou
yujunzhou
Follow
John6666's profile picture
1 follower
·
0 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
18 days ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
submitted
a paper
18 days ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
updated
a model
18 days ago
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
View all activity
Organizations
None yet
yujunzhou
's models
251
Sort: Recently updated
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lam0.8-win500-segCorrect
8B
•
Updated
Jul 21, 2025
•
5
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lam0.8-win500
8B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win500
4B
•
Updated
Jul 21, 2025
•
2
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Semantic
8B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.9-win500
8B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.8-win10
8B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win10-segCorrect
4B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.8-win10-segCorrect
8B
•
Updated
Jul 21, 2025
•
3
yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win10
4B
•
Updated
Jul 21, 2025
•
4
yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.9-win500-segCorrect
4B
•
Updated
Jul 21, 2025
•
4
yujunzhou/qwen3-4B-gsm8k-checkpoint-30
4B
•
Updated
Jul 13, 2025
•
4
Previous
1
...
7
8
9
Next