Xi's picture

Xi

xi0v

·

AI & ML interests

Reinforcement learning, Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.

Recent Activity

liked a model about 9 hours ago

djuna/Q2.5-Veltha-14B

liked a model about 9 hours ago

Ttimofeyka/Tissint-14B-v1.2-128k-RP

liked a model about 9 hours ago

ozone-research/asteroid-14b-v0.1

View all activity

Organizations

xi0v's activity

upvoted a paper about 15 hours ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 14 days ago • 27

upvoted an article 1 day ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 197

upvoted 4 papers 2 days ago

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published 4 days ago • 23

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published 3 days ago • 53

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Paper • 2503.07067 • Published 4 days ago • 27

upvoted 2 papers 5 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 8 days ago • 77

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 7 days ago • 60

upvoted a collection 8 days ago

cool datasets

156 items • Updated about 9 hours ago • 15

upvoted an article 9 days ago

Article

G2P Shrinks Speech Models

By

•

Feb 5

• 36

upvoted a collection 9 days ago

granite-abliterated

3 items • Updated 9 days ago • 3

upvoted an article 11 days ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22, 2024

• 26

upvoted a paper 12 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published 16 days ago • 25

upvoted a paper 13 days ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published 15 days ago • 26

upvoted an article 14 days ago

Article

Common AI Model Formats

By

•

14 days ago

• 29

upvoted a paper 14 days ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 18 days ago • 68

upvoted 2 papers 15 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 16 days ago • 68

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 22 days ago • 66

upvoted a paper 16 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 18 days ago • 27

upvoted a paper 17 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 21 days ago • 162