Liu's picture

1 7

Liu

Shiweiliuiiiiiii

https://shiweiliuiiiiiii.github.io/

Shiwei_Liu66

AI & ML interests

LLM, reasoning, ML efficiency

Recent Activity

upvoted a paper 3 days ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

commented on a paper 3 days ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

upvoted a paper 4 days ago

The Curse of Depth in Large Language Models

View all activity

Organizations

None yet

Shiweiliuiiiiiii's activity

upvoted a paper 3 days ago

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Paper • 2502.07490 • Published 4 days ago • 8

upvoted a paper 4 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 6 days ago • 24

upvoted a paper 23 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 25 days ago • 24

upvoted a paper about 1 month ago

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Paper • 2501.06842 • Published Jan 12 • 15

upvoted a paper about 2 months ago

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Paper • 2412.13795 • Published Dec 18, 2024 • 19

upvoted a paper 2 months ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 37

upvoted a paper 7 months ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11, 2024 • 32