Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Zidi Xiong
polaris-73
Follow
shanchen's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
published
an
article
about 3 hours ago
Budget Alignment: Making Models Reason in the User’s Language
updated
a model
18 days ago
polaris-73/ds1p5b_grpo_ifeval_skywork_continue-global_step_400
published
a model
18 days ago
polaris-73/ds1p5b_grpo_ifeval_skywork_continue-global_step_400
View all activity
Organizations
polaris-73
's models
91
Sort: Recently updated
polaris-73/ds7b_grpo_math_gsm8k_reinforce-global_step_800
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_reinforce-global_step_600
8B
•
Updated
Aug 1
•
1
polaris-73/ds7b_grpo_math_gsm8k_reinforce-global_step_400
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_reinforce-global_step_200
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_nokl-global_step_870
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k_nokl-global_step_800
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_nokl-global_step_600
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_nokl-global_step_400
8B
•
Updated
Aug 1
•
6
polaris-73/ds7b_grpo_math_gsm8k_nokl-global_step_200
8B
•
Updated
Aug 1
•
3
polaris-73/ds7b_grpo_math_gsm8k-global_step_200
Updated
Jul 17
polaris-73/ds1p5b_grpo_math_gsm8k_cliphigh-global_step_870
2B
•
Updated
Jul 14
•
2
polaris-73/ds1p5b_grpo_math_gsm8k_cliphigh-global_step_800
2B
•
Updated
Jul 14
•
2
polaris-73/ds1p5b_grpo_math_gsm8k_cliphigh-global_step_600
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_cliphigh-global_step_400
2B
•
Updated
Jul 14
polaris-73/ds1p5b_grpo_math_gsm8k_cliphigh-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_870
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_600
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_400
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_ppo-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_870
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_800
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_600
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_400
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k_nokl-global_step_200
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_870
2B
•
Updated
Jul 14
•
1
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_800
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_600
2B
•
Updated
Jul 14
•
1
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_400
2B
•
Updated
Jul 14
•
3
polaris-73/ds1p5b_grpo_math_gsm8k-global_step_200
2B
•
Updated
Jul 14
•
1
polaris-73/ds7b_grpo_math_faithful_step200
8B
•
Updated
Jul 2
•
3
•
1
Previous
1
2
3
4
Next