wen's picture

2 6

wen

zhengwenzhen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

liked a Space 4 days ago

nanotron/ultrascale-playbook

upvoted an article 6 months ago

Recreating o1 at Home with Role-Play LLMs

View all activity

Organizations

models

None public yet

datasets

None public yet