Yujun Zhou's picture

2 14 1

Yujun Zhou

yujunzhou

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

submitted a paper 18 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

updated a model 18 days ago

yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B

View all activity

Organizations

None yet

yujunzhou 's models 251

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lam0.8-win500-segCorrect

8B • Updated Jul 21, 2025 • 5

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lam0.8-win500

8B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win500

4B • Updated Jul 21, 2025 • 2

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Semantic

8B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.9-win500

8B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.8-win10

8B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win10-segCorrect

4B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen2.5-Math-7B-Cluster-lab0.8-win10-segCorrect

8B • Updated Jul 21, 2025 • 3

yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.8-win10

4B • Updated Jul 21, 2025 • 4

yujunzhou/MATH-TTT-Qwen3-4B-Base-Cluster-lab0.9-win500-segCorrect

4B • Updated Jul 21, 2025 • 4

yujunzhou/qwen3-4B-gsm8k-checkpoint-30

4B • Updated Jul 13, 2025 • 4