Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Weigao Sun's picture
5 29 13

Weigao Sun

weigao266
21world's profile picture SII-xrliu's profile picture SteveSHEN's profile picture
·
https://weigao266.github.io/
  • sunweigao
  • weigao266

AI & ML interests

Algo & MLSys

Organizations

linear-moe-hub's profile picture

commented a paper 3 months ago

Native Hybrid Attention for Efficient Sequence Modeling

Paper • 2510.07019 • Published Oct 8, 2025 • 16 •
2
commented a paper 5 months ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published Aug 13, 2025 • 53 •
2
commented a paper 10 months ago

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Paper • 2503.05447 • Published Mar 7, 2025 • 8 •
2
commented 3 papers 11 months ago

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published Mar 3, 2025 • 18 •
2

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19, 2025 • 36 •
2

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published Feb 11, 2025 • 23 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs