wen
zhengwenzhen
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 9 hours ago
Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large
Language Model Pretraining
liked
a Space
4 days ago
nanotron/ultrascale-playbook
upvoted
an
article
6 months ago
Recreating o1 at Home with Role-Play LLMs
Organizations
models
None public yet
datasets
None public yet