Submitted by zstanjj 68 HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems · 6 authors 22
Submitted by prlz77 18 Controlling Language and Diffusion Models by Transporting Activations · 7 authors 2
Submitted by Yang130 13 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution · 8 authors 2
Submitted by LiquidAmmonia 11 DreamPolish: Domain Score Distillation With Progressive Geometry Generation · 8 authors 2
Submitted by xiaojin66 9 GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details · 9 authors 1
Submitted by Ksgk-fy 7 Inference Optimal VLMs Need Only One Visual Token but Larger Models · 4 authors 1
Submitted by ksoman 6 Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge · 8 authors 1
Submitted by mbar0075 4 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation · 2 authors 1