Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Cheng Wang
LLucass
Follow
21world's profile picture
1 follower
·
14 following
https://wangcheng0116.github.io/
WangCheng0116
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
commented
on
a paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
updated
a model
about 2 months ago
LLucass/GRPO-7B
View all activity
Organizations
LLucass
's models
114
Sort: Recently updated
LLucass/R1-Zero-Qwen-7B-Math
Updated
Jul 3
LLucass/R1-Zero-Qwen-7B-Math-Gaussian
Updated
Jul 3
LLucass/Qwen2.5-Math-7B-GRPO-Numina
Updated
Jul 3
LLucass/Gaussian-Qwen2.5-Math-7B-Numina
Updated
Jul 3
LLucass/Qwen2.5-Math-7B-GRPO-DeepScaler
Updated
Jul 3
LLucass/Gaussian-Qwen2.5-Math-7B-DeepScaler
Updated
Jul 3
LLucass/Gaussian-Qwen2.5-7B-DeepScaler
Updated
Jul 3
LLucass/Qwen2.5-7B-GRPO-DeepScaler
Updated
Jul 3
LLucass/Gaussian-Qwen2.5-7B
8B
•
Updated
Jul 3
•
7
LLucass/Qwen2.5-7B-GRPO
8B
•
Updated
Jul 3
•
6
LLucass/GRPO_0.2_0.28
Text Generation
•
2B
•
Updated
Jul 2
•
8
LLucass/Gaussian_0.2_0.28
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/Gaussian_0.2_0.2
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/GRPO_0.2_0.2
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/Ours_Dr_Gaussian
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/Gaussian
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/Ours-2sigma
2B
•
Updated
Jul 2
•
7
LLucass/Ours_Dr
Text Generation
•
2B
•
Updated
Jul 2
•
7
LLucass/DrGRPO
Text Generation
•
2B
•
Updated
Jul 2
•
8
LLucass/Ours
Text Generation
•
2B
•
Updated
Jul 1
•
7
LLucass/GRPO
Text Generation
•
2B
•
Updated
Jul 1
•
6
LLucass/GRPO-Qwen7B-s1
Updated
Jul 1
LLucass/GRPO-DAPO
Updated
Jul 1
LLucass/Bounded_PRESS_GRPO_0.2_beta_0.01_n_generations_12
2B
•
Updated
Jun 25
•
5
LLucass/Bounded_PRESS_GRPO_2.0_beta_0.01_n_generations_12
2B
•
Updated
Jun 24
•
5
LLucass/Bounded_PRESS_GRPO_1.0_beta_0.01_n_generations_12
2B
•
Updated
Jun 24
•
5
LLucass/Bounded_PRESS_GRPO_0.5_beta_0.01_n_generations_12
2B
•
Updated
Jun 24
•
5
LLucass/Bounded_PRESS_GRPO_0.4_beta_0.01_n_generations_12
2B
•
Updated
Jun 24
•
5
LLucass/Bounded_PRESS_GRPO_0.3_beta_0.01_n_generations_12
2B
•
Updated
Jun 24
•
5
LLucass/Tanh_PRESS_GRPO_2.0_beta_0.01_n_generations_12
2B
•
Updated
Jun 23
•
5
Previous
1
2
3
4
Next