Collections
Discover the best community collections!
Collections including paper arxiv:2312.11514
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • 2312.09390 • Published • 33 -
OneLLM: One Framework to Align All Modalities with Language
Paper • 2312.03700 • Published • 24 -
Generative Multimodal Models are In-Context Learners
Paper • 2312.13286 • Published • 36 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 147 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 97 -
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 94 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
Distributed Inference and Fine-tuning of Large Language Models Over The Internet
Paper • 2312.08361 • Published • 28 -
Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes
Paper • 2312.06353 • Published • 7 -
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache
Paper • 2401.02669 • Published • 16 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 143 -
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
Paper • 2312.03491 • Published • 35 -
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Paper • 2312.06134 • Published • 3 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259
-
Training Chain-of-Thought via Latent-Variable Inference
Paper • 2312.02179 • Published • 11 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259 -
TIP: Text-Driven Image Processing with Semantic and Restoration Instructions
Paper • 2312.11595 • Published • 6 -
Quantum Denoising Diffusion Models
Paper • 2401.07049 • Published • 14
-
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
Paper • 2311.12793 • Published • 18 -
PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Paper • 2311.12198 • Published • 22 -
CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation
Paper • 2311.18775 • Published • 6 -
Code Llama: Open Foundation Models for Code
Paper • 2308.12950 • Published • 25