LLM as a Broken Telephone: Iterative Generation Distorts Information Paper β’ 2502.20258 β’ Published 14 days ago β’ 21
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper β’ 2503.07027 β’ Published 4 days ago β’ 21
On the Acquisition of Shared Grammatical Representations in Bilingual Language Models Paper β’ 2503.03962 β’ Published 8 days ago β’ 3
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper β’ 2503.04606 β’ Published 7 days ago β’ 7
How to Steer LLM Latents for Hallucination Detection? Paper β’ 2503.01917 β’ Published 12 days ago β’ 10
PokΓ©Champ: an Expert-level Minimax Language Agent Paper β’ 2503.04094 β’ Published 8 days ago β’ 9
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper β’ 2503.03983 β’ Published 8 days ago β’ 22
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper β’ 2501.15369 β’ Published Jan 26 β’ 12
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated about 6 hours ago β’ 299
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 11 items β’ Updated about 6 hours ago β’ 93
π§ Abliteration Collection Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration β’ 7 items β’ Updated Nov 18, 2024 β’ 30
Transcription Collection Transcribe interviews for free with Whisper in Spaces. β’ 10 items β’ Updated Oct 1, 2024 β’ 8
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format β’ 11 items β’ Updated Jul 2, 2024 β’ 9
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models Paper β’ 2403.02246 β’ Published Mar 4, 2024 β’ 1
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Paper β’ 2403.16422 β’ Published Mar 25, 2024 β’ 1
Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians Paper β’ 2403.17898 β’ Published Mar 26, 2024 β’ 15