1 15 11

Zeb K

baobaoh

zebwithb

AI & ML interests

None yet

Recent Activity

upvoted a collection 11 days ago

Qwen2.5-VL

upvoted a paper 11 days ago

Qwen2.5-VL Technical Report

upvoted a paper 11 days ago

Reward Steering with Evolutionary Heuristics for Decoding-time Alignment

View all activity

Organizations

None yet

baobaoh's activity

upvoted a collection 11 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 8 items • Updated 17 days ago • 396

upvoted 4 papers 11 days ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published 15 days ago • 43

New activity in m-a-p/MERT-v1-95M 12 days ago

MERT-v1-95M not compatible with Transformers >=4.44.0

#4 opened 12 days ago by

baobaoh

upvoted a paper 12 days ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 15 days ago • 38

upvoted an article 12 days ago

Article

The Large Language Model Course

•

Jan 16

• 130

upvoted 8 papers 14 days ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published 15 days ago • 9

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 123

Distillation Scaling Laws

Paper • 2502.08606 • Published 29 days ago • 46

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 28 days ago • 184

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 29 days ago • 143

Large Language Diffusion Models

Paper • 2502.09992 • Published 27 days ago • 103

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 21 days ago • 60

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 21 days ago • 179

liked a model 14 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 249k • • 1.72k

liked a model 18 days ago

openbmb/MiniCPM-o-2_6

Any-to-Any • Updated 10 days ago • 333k • 1.04k

liked a Space 22 days ago

Music2emo

📊

Towards Unified Music Emotion Recognition across Dimensional

liked a model 23 days ago

amaai-lab/music2emo

Updated 29 days ago • 2