Submitted by surokpro2 78 Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders · 6 authors 3
Submitted by zhoutianyi 61 What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective · 3 authors 4
Submitted by abhi1nandy2 25 A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents · 5 authors 3
Submitted by nutation 19 BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments · 6 authors 6
Submitted by Wesleythu 17 Constraint Back-translation Improves Complex Instruction Following of Large Language Models · 6 authors 2
Submitted by yongchanghao 16 NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks · 3 authors 2
Submitted by JueZhang 9 Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks · 9 authors 2
Submitted by youngzhou12 9 BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays · 7 authors 2
Submitted by jedyang97 5 Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use · 5 authors 2
Submitted by kargaranamir 3 GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages · 3 authors 2