Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Cheng Wang
LLucass
Follow
21world's profile picture
1 follower
·
14 following
https://wangcheng0116.github.io/
WangCheng0116
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
commented
on
a paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
updated
a model
about 2 months ago
LLucass/GRPO-7B
View all activity
Organizations
LLucass
's models
114
Sort: Recently updated
LLucass/ACC_TT_L0.2_H0.28_dr_grpo
Text Generation
•
2B
•
Updated
Jun 9
•
8
LLucass/ACC_TT_L0.2_H0.28_grpo
Text Generation
•
2B
•
Updated
Jun 9
•
10
LLucass/ACC_TT_L0.2_H0.2_dr_grpo
Text Generation
•
2B
•
Updated
Jun 9
•
8
LLucass/ACC_TT_L0.2_H0.2_grpo
Text Generation
•
2B
•
Updated
Jun 9
•
8
LLucass/FF_L0.2_H0.28_dr_grpo
Text Generation
•
2B
•
Updated
Jun 9
•
8
LLucass/TT_L0.2_H0.28_dr_grpo
Text Generation
•
2B
•
Updated
Jun 8
•
7
LLucass/FF_L0.2_H0.28_grpo
2B
•
Updated
Jun 8
•
6
LLucass/TT_L0.2_H0.2_dr_grpo
Text Generation
•
2B
•
Updated
Jun 8
•
8
LLucass/TT_L0.2_H0.28_grpo
Text Generation
•
2B
•
Updated
Jun 8
•
8
LLucass/FF_L0.2_H0.2_dr_grpo
Text Generation
•
2B
•
Updated
Jun 8
•
8
LLucass/TT_L0.2_H0.2_grpo
Text Generation
•
2B
•
Updated
Jun 8
•
11
LLucass/FT_L0.2_H0.2_grpo
2B
•
Updated
Jun 8
•
6
LLucass/jaccard-FT0.28-GRPO
Updated
Jun 8
LLucass/jaccard-TF-GRPO
Updated
Jun 8
LLucass/jaccard-FF-GRPO
2B
•
Updated
Jun 8
•
6
LLucass/jaccard-FT-GRPO
Updated
Jun 8
LLucass/FT-GRPO
2B
•
Updated
Jun 8
•
6
LLucass/jaccard-TT-GRPO
2B
•
Updated
Jun 7
•
6
LLucass/FT-DrGRPO
Updated
Jun 7
LLucass/TT-DrGRPO
2B
•
Updated
Jun 7
•
4
LLucass/FF-DrGRPO
2B
•
Updated
Jun 7
•
6
LLucass/TT-GRPO
2B
•
Updated
Jun 7
•
6
LLucass/FF-GRPO
2B
•
Updated
Jun 7
•
6
LLucass/DRA-GRPO
Text Generation
•
2B
•
Updated
Jun 7
•
8
Previous
1
2
3
4
Next