Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
1
Jiarui Yao
FlippyDora
Follow
research4pan's profile picture
1 follower
·
16 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
upvoted
a
paper
20 days ago
DINOv3
liked
a model
23 days ago
rednote-hilab/dots.ocr
View all activity
Organizations
FlippyDora
's models
60
Sort: Recently updated
FlippyDora/Qwen1.5B-Inst_numina_raft1_orig_eos
Text Generation
•
2B
•
Updated
Mar 6
•
8
FlippyDora/qwen_sft_1
Text Generation
•
8B
•
Updated
Mar 4
•
6
FlippyDora/qwen_sft_2
Text Generation
•
8B
•
Updated
Mar 4
•
6
FlippyDora/Qwen_numina_raft3_orig_eos
Text Generation
•
8B
•
Updated
Mar 1
•
8
FlippyDora/Qwen_numina_raft2_orig_eos
Text Generation
•
8B
•
Updated
Mar 1
•
5
FlippyDora/3B_rpr_mixtureBT_criteria_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24
•
2
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k5
3B
•
Updated
Feb 24
•
2
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5_epoch5_k10
3B
•
Updated
Feb 24
•
2
FlippyDora/3B_mixtureBT_rpr_criteria_k5_epoch5_loadBalance0.5
3B
•
Updated
Feb 22
•
2
FlippyDora/3B_mixtureBT_helpsteer2_pkusafe_attr_heads6_loadBalance0.5
3B
•
Updated
Feb 12
•
2
FlippyDora/3B_mixtureBT_rpr_criteria_epoch5_loadBalance0.5
3B
•
Updated
Feb 10
•
2
FlippyDora/3B_rpr_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8
•
2
FlippyDora/3B_helpsteer2_mixtureBT_attr_loadBalance0.5
3B
•
Updated
Feb 8
•
2
FlippyDora/CoT_Translator
7B
•
Updated
Feb 6
•
6
FlippyDora/CoT_Prover
7B
•
Updated
Feb 4
•
4
FlippyDora/dpo_rm
3B
•
Updated
Jan 21
•
2
FlippyDora/dpo_remove
3B
•
Updated
Jan 19
•
6
FlippyDora/origin_preference700k
3B
•
Updated
Jan 18
•
2
FlippyDora/MixtureBT_preference700k_LoadBalance0.5
3B
•
Updated
Jan 18
•
2
FlippyDora/MathLLM-StatementTranslator-7B-v0.1
7B
•
Updated
Jan 17
•
3
FlippyDora/MixtureBT_Helpsteer2_LoadBalance0.5
3B
•
Updated
Jan 16
•
2
FlippyDora/step_dpo_mistral_lr1e-7_step200
7B
•
Updated
Dec 5, 2024
•
4
FlippyDora/step_dpo_mistral_lr1e-7_step100
7B
•
Updated
Dec 5, 2024
•
4
FlippyDora/mdpo
3B
•
Updated
Nov 21, 2024
•
4
FlippyDora/mdpo_guess_cities
3B
•
Updated
Nov 21, 2024
•
6
FlippyDora/dpo-rm-translate
Updated
Nov 17, 2024
FlippyDora/gemma-2b-it_lora_r128_lr5e-4_dpo
Updated
Oct 23, 2024
•
1
FlippyDora/gemma-2b-it_lora_r32_lr5e-4_dpo
Updated
Oct 22, 2024
•
1
FlippyDora/gemma-2b-it_lora_r16_lr5e-4_dpo
Updated
Oct 22, 2024
•
1
FlippyDora/gemma-2b-it_lr1e-5_ultrafeedback
3B
•
Updated
Oct 16, 2024
•
3
Previous
1
2
Next