Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Cheng Wang
LLucass
Follow
21world's profile picture
1 follower
·
14 following
https://wangcheng0116.github.io/
WangCheng0116
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
commented
on
a paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
updated
a model
about 2 months ago
LLucass/GRPO-7B
View all activity
Organizations
LLucass
's models
114
Sort: Recently updated
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.01_n_generations_12
2B
•
Updated
Jun 22
•
5
LLucass/Tanh_PRESS_GRPO_4.0_beta_0.01_n_generations_12
2B
•
Updated
Jun 22
•
5
LLucass/Tanh_PRESS_GRPO_0.5_beta_0.01_n_generations_12
2B
•
Updated
Jun 22
•
5
LLucass/PRESS_GRPO_2.0_beta_0.01_n_generation_12
2B
•
Updated
Jun 22
•
5
LLucass/GRPO_beta_0.01_n_generation_12
2B
•
Updated
Jun 22
•
5
LLucass/Tanh_PRESS_GRPO_2.0_beta_0.04
2B
•
Updated
Jun 22
•
5
LLucass/Tanh_PRESS_GRPO_1.0_beta_0.04
2B
•
Updated
Jun 22
•
5
LLucass/Tanh_PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
Jun 22
•
4
LLucass/ACC_GRPO_beta_0.01
2B
•
Updated
Jun 22
•
5
LLucass/ACC_PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
Jun 22
•
5
LLucass/PRESS_GRPO_4.0_beta_0.01
2B
•
Updated
Jun 22
•
5
LLucass/PRESS_GRPO_2.0_beta_0.01
2B
•
Updated
Jun 22
•
5
LLucass/GRPO_beta_0.01
2B
•
Updated
Jun 22
•
5
LLucass/PRESS_GRPO_2.0_beta_0.001
2B
•
Updated
Jun 21
•
5
LLucass/PRESS_GRPO_1.0_beta_0.001
2B
•
Updated
Jun 21
•
5
LLucass/GRPO_beta
Updated
Jun 21
LLucass/PRESS_GRPO_0.5_beta_0.001
2B
•
Updated
Jun 21
•
5
LLucass/GRPO_beta_0.001
2B
•
Updated
Jun 21
•
5
LLucass/PRESS_GRPO_0.2
2B
•
Updated
Jun 21
•
5
LLucass/PRESS_GRPO_4.0
Updated
Jun 21
LLucass/PRESS_GRPO_2.0
2B
•
Updated
Jun 21
•
5
LLucass/PRESS_GRPO_1.5
Updated
Jun 21
LLucass/PRESS_GRPO_1.0
2B
•
Updated
Jun 21
•
4
LLucass/PRESS_GRPO_0.5
2B
•
Updated
Jun 21
•
5
LLucass/DR_GRPO
Updated
Jun 21
LLucass/qwen-math-7b-entropy-top1k
Updated
Jun 17
LLucass/Entropy-Maximization-All-Step2
8B
•
Updated
Jun 14
•
6
LLucass/Entropy-Minimization-All-Step2
8B
•
Updated
Jun 14
•
5
LLucass/Entropy-Maximization-Bot20-Step2
8B
•
Updated
Jun 14
•
6
LLucass/FF_L0.2_H0.2_grpo
Text Generation
•
2B
•
Updated
Jun 13
•
7
Previous
1
2
3
4
Next