Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
james's picture
1

james

jamesjunyuguo
·
https://jamesjunyuguo.github.io/
  • jamesjunyuguo

AI & ML interests

None yet

Organizations

UC Berkeley's profile picture

Collections 1

reward modelling
  • Inference-Time Scaling for Generalist Reward Modeling

    Paper • 2504.02495 • Published Apr 3 • 56
reward modelling
  • Inference-Time Scaling for Generalist Reward Modeling

    Paper • 2504.02495 • Published Apr 3 • 56

models 11

jamesjunyuguo/llama-3-3b-math-orca-qlora-10k-ep1

Updated Jun 13

jamesjunyuguo/dpo-llama-3-1-8b-math

Text Generation • 8B • Updated Apr 23

jamesjunyuguo/llama-3-1-8b-math-orca-qlora-10k-ep1

Updated Apr 23

jamesjunyuguo/llama-3-1-8b-sft

Updated Apr 16

jamesjunyuguo/qwen-2.5-3b-r1-countdown

Text Generation • 3B • Updated Apr 10 • 3

jamesjunyuguo/qwen-2.5-3b-r1-distort-4.0

3B • Updated Mar 13

jamesjunyuguo/qwen-2.5-3b-r1-distort-1.0

Text Generation • 3B • Updated Mar 13

jamesjunyuguo/qwen-2.5-3b-r1-distort-3.0

Text Generation • 3B • Updated Mar 13

jamesjunyuguo/qwen-2.5-3b-r1-distort

3B • Updated Mar 13 • 2

jamesjunyuguo/llama-3-1-8b-math-orca-qlora-10k-ep1-merged

8B • Updated Feb 28
View 11 models

datasets 1

jamesjunyuguo/philschmid-llama-3-1-8b-math-orca-spectr-philschmid-DMath-candidates

Viewer • Updated Jul 24 • 1.96k • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs