Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
6
Cheng Wang
LLucass
Follow
21world's profile picture
1 follower
·
14 following
https://wangcheng0116.github.io/
WangCheng0116
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
commented
on
a paper
2 days ago
Model-Task Alignment Drives Distinct RL Outcomes
updated
a model
about 2 months ago
LLucass/GRPO-7B
View all activity
Organizations
LLucass
's models
114
Sort: Recently updated
LLucass/GRPO-7B
8B
•
Updated
Jul 12
•
7
LLucass/Gaussian-7B
8B
•
Updated
Jul 12
•
7
LLucass/Gaussian-1.5B-Format-New-Repro
2B
•
Updated
Jul 11
•
7
LLucass/GRPO-1.5B-Format-New-Repro
2B
•
Updated
Jul 11
•
7
LLucass/Gaussian-1.5B-Format-New
2B
•
Updated
Jul 11
•
7
LLucass/GRPO-1.5B-Format-New
2B
•
Updated
Jul 11
•
6
LLucass/GRPO-1.5B-Format-Old
2B
•
Updated
Jul 11
•
6
LLucass/Gaussian-1.5B-Format-Old-Numel
2B
•
Updated
Jul 10
•
6
LLucass/GRPO-1.5B-Format-Old-Numel
2B
•
Updated
Jul 10
•
7
LLucass/Gaussian-1.5B-Format-New-Numel
2B
•
Updated
Jul 10
•
7
LLucass/GRPO-1.5B-Format-New-Numel
2B
•
Updated
Jul 10
•
7
LLucass/Gaussian-1.5B
2B
•
Updated
Jul 9
•
7
LLucass/GRPO-1.5B
2B
•
Updated
Jul 9
•
7
LLucass/Gaussian-1.5B-Format-Old
2B
•
Updated
Jul 9
•
5
LLucass/Gaussian-1.5B-Format
2B
•
Updated
Jul 9
•
6
LLucass/GRPO-1.5B-Format
2B
•
Updated
Jul 9
•
6
LLucass/Gaussian-1.5B-Cos
2B
•
Updated
Jul 8
•
7
LLucass/GRPO-1.5B-Cos
2B
•
Updated
Jul 8
•
6
LLucass/pass-k
Updated
Jul 7
LLucass/Cov-Vis
2B
•
Updated
Jul 6
•
7
LLucass/Gaussian-Qwen-1.5B
2B
•
Updated
Jul 5
•
6
LLucass/GRPO-Qwen-1.5B
2B
•
Updated
Jul 5
•
7
LLucass/GRPO-7B-beta-0.00
8B
•
Updated
Jul 5
•
7
LLucass/Gaussian-7B-beta-0.00
8B
•
Updated
Jul 5
•
7
LLucass/GRPO-7B-Base
8B
•
Updated
Jul 4
•
7
LLucass/Gaussian-7B-Base
8B
•
Updated
Jul 4
•
6
LLucass/Qwen-2.5-7B-Simple-RL-RS-Gaussian
8B
•
Updated
Jul 4
•
7
LLucass/Qwen-2.5-7B-Simple-RL-RS
8B
•
Updated
Jul 4
•
7
LLucass/Qwen-2.5-7B-Simple-RL-Gaussian
Updated
Jul 3
LLucass/Qwen-2.5-7B-Simple-RL
Updated
Jul 3
Previous
1
2
3
4
Next