Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zy's picture
5 7 13

zy

lu-vae
chalengr's profile picture Tuyabei's profile picture bhxiang's profile picture
·

AI & ML interests

NLP text generation

Recent Activity

upvoted a paper 23 days ago
Mixture-of-Depths Attention
upvoted a paper 23 days ago
Attention Residuals
liked a dataset 23 days ago
stepfun-ai/Step-3.5-Flash-SFT
View all activity

Organizations

Chinese-Vicuna's profile picture StepFun's profile picture mask-mask-mask-mask's profile picture

upvoted 2 papers 23 days ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 23 days ago • 79

Attention Residuals

Paper • 2603.15031 • Published 24 days ago • 176
upvoted a paper about 2 months ago

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 194
upvoted 2 papers about 1 year ago

Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think

Paper • 2503.00948 • Published Mar 2, 2025 • 3

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24, 2025 • 32
upvoted 2 papers almost 2 years ago

On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion

Paper • 2406.15480 • Published Jun 17, 2024 • 2

Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging

Paper • 2406.15479 • Published Jun 17, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs