arxiv:2501.17116
Guoshuai Zhao
crayonshine
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
Optimizing Large Language Model Training Using FP4 Quantization
authored
a paper
7 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
upvoted
a
paper
9 months ago
FP8-LM: Training FP8 Large Language Models
Organizations
None yet