Generative Refocusing: Flexible Defocus Control from a Single Image Paper • 2512.16923 • Published 16 days ago • 37
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 26 days ago • 74
Robix: A Unified Model for Robot Interaction, Reasoning and Planning Paper • 2509.01106 • Published Sep 1, 2025 • 51
PromptBridge: Cross-Model Prompt Transfer for Large Language Models Paper • 2512.01420 • Published Dec 1, 2025 • 9
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published Aug 20, 2025 • 85
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published about 1 month ago • 17
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization Paper • 2507.06181 • Published Jul 8, 2025 • 44
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published Dec 2, 2025 • 32
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Paper • 2506.15211 • Published Jun 18, 2025 • 38
Architecture Decoupling Is Not All You Need For Unified Multimodal Model Paper • 2511.22663 • Published Nov 27, 2025 • 29
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles Paper • 2505.19914 • Published May 26, 2025 • 45
ORPO-Distill: Mixed-Policy Preference Optimization for Cross-Architecture LLM Distillation Paper • 2509.25100 • Published Sep 29, 2025 • 1
Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published May 17, 2025 • 40
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production Paper • 2505.11432 • Published May 16, 2025 • 2
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection Paper • 2505.07293 • Published May 12, 2025 • 28