Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhuokai Zhao's picture
2 6 1

Zhuokai Zhao

zhuokai
EchoRaven's profile picture StarGazerrr's profile picture
·
https://zhuokai-zhao.com/
  • zhuokaiz
  • zhuokaizhao

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

updated a model 11 days ago
zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.2_Qwen2.5-Math-1.5B_zzk
published a model 11 days ago
zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.2_Qwen2.5-Math-1.5B_zzk
updated a model 11 days ago
zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.0_Qwen2.5-Math-1.5B_zzk
View all activity

Organizations

MJ-Bench-Team's profile picture Project of MoE reward model's profile picture

zhuokai 's models 8

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated 11 days ago

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated 11 days ago

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated 11 days ago

zhuokai/as_negexp_explore_1.2_stable_0.1_decay_freq_25_warmup_period_10_negexp_Qwen2.5-Math-1.5B_zzk

Updated 11 days ago

zhuokai/gpg_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated 11 days ago

zhuokai/initial_grpo_baseline_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated 12 days ago

zhuokai/initial_grpo_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated 12 days ago

zhuokai/initial_grpo_baseline_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated 12 days ago
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs