Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Project of MoE reward model

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Zhuokai Zhao's profile picture Shengyi Qian's profile picture Yuhang Zhou's profile picture Xiaoyu Liu's profile picture Jing Zhu's profile picture wave's profile picture

models 6

MoeReward/rl_checkpoints

Updated Jun 27

MoeReward/lora_checkpoint

Updated Mar 30

MoeReward/reward_lora_qwen_1_5_base

Updated Mar 21 • 6

MoeReward/reward_qwen_1_5

14B • Updated Mar 17 • 1

MoeReward/reward_lora_qwen_1_5

Updated Mar 17 • 5

MoeReward/sft_full_param_qwen_1_5

14B • Updated Mar 16 • 4

datasets 54

MoeReward/combined_rlhf_dataset_grpo_imdb_main_2K

Viewer • Updated May 6 • 2k • 7

MoeReward/combined_rlhf_dataset_grpo_metamath_main_2K

Viewer • Updated May 6 • 2k • 2

MoeReward/combined_rlhf_dataset_grpo_arc_main_2K

Viewer • Updated May 6 • 2k • 4

MoeReward/combined_rlhf_dataset_grpo_nq_main_2K

Viewer • Updated May 6 • 2k • 7

MoeReward/combined_rlhf_dataset_grpo_equal_dist_2K

Viewer • Updated May 6 • 2k • 2

MoeReward/combined_rlhf_dataset_grpo_imdb_main

Viewer • Updated Apr 1 • 4k • 8

MoeReward/combined_rlhf_dataset_grpo_metamath_main

Viewer • Updated Apr 1 • 4k • 10

MoeReward/combined_rlhf_dataset_grpo_arc_main

Viewer • Updated Apr 1 • 4k • 6

MoeReward/combined_rlhf_dataset_grpo_nq_main

Viewer • Updated Apr 1 • 4k • 4

MoeReward/combined_rlhf_dataset_grpo_equal_dist

Viewer • Updated Apr 1 • 4k • 3
View 54 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs