Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
173
7
4
Zekun Wang
kugwzk
Follow
0 followers
·
8 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
s1: Simple test-time scaling
upvoted
a
paper
14 days ago
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
authored
a paper
16 days ago
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
View all activity
Organizations
Papers
9
arxiv:
2501.11873
arxiv:
2412.09605
arxiv:
2412.04454
arxiv:
2409.13199
Expand 9 papers
models
1
kugwzk/my_imagenet_ckpt
Updated
Apr 22, 2024
datasets
1
kugwzk/diffusion-data
Updated
Jan 9, 2024
•
3