Submitted by akhaliq 53 Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models · 4 authors 6
Submitted by akhaliq 44 ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models · 9 authors 7
Submitted by akhaliq 38 Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization · 4 authors 1
Submitted by akhaliq 26 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training · 8 authors 1
Submitted by akhaliq 25 AutoCoder: Enhancing Code Large Language Model with \textsc{AIEV-Instruct} · 3 authors 9
Submitted by akhaliq 17 CraftsMan: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner · 7 authors 2
Submitted by akhaliq 15 Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach · 13 authors
Submitted by akhaliq 14 Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition · 6 authors
Submitted by akhaliq 13 Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining · 5 authors
Submitted by akhaliq 6 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting · 7 authors