PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
liked
a model
about 1 month ago
stepfun-ai/step3-fp8
liked
a model
about 1 month ago
stepfun-ai/step3
upvoted
a
collection
about 1 month ago
Step3
Organizations
None yet