Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yixuan Wei's picture
3 6 16

Yixuan Wei

EasonWei
0xSojalSec's profile picture manik-hossain's profile picture masterwayne1's profile picture
·
  • weiyx16

AI & ML interests

None yet

Recent Activity

authored a paper about 23 hours ago
mHC: Manifold-Constrained Hyper-Connections
upvoted a paper 5 months ago
FP4 All the Way: Fully Quantized Training of LLMs
upvoted a paper 5 months ago
Group Sequence Policy Optimization
View all activity

Organizations

OneModel's profile picture Xwin-LM's profile picture DeepSeek's profile picture

upvoted 3 papers 5 months ago

FP4 All the Way: Fully Quantized Training of LLMs

Paper • 2505.19115 • Published May 25, 2025 • 3

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 59
upvoted a paper 7 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131
upvoted a paper about 1 year ago

On Memorization of Large Language Models in Logical Reasoning

Paper • 2410.23123 • Published Oct 30, 2024 • 18
upvoted a collection over 1 year ago

Qwen2.5

Collection
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 3 days ago • 672
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs