-
Attention Is All You Need
Paper • 1706.03762 • Published • 55 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 35 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 53 -
Lost in the Middle: How Language Models Use Long Contexts
Paper • 2307.03172 • Published • 40
Collections
Discover the best community collections!
Collections including paper arxiv:2305.11206
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper • 2211.05100 • Published • 29 -
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Paper • 2201.11115 • Published -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 17 -
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 32
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
LIMA: Less Is More for Alignment
Paper • 2305.11206 • Published • 23 -
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Paper • 2309.11998 • Published • 25 -
Identifying Mislabeled Data using the Area Under the Margin Ranking
Paper • 2001.10528 • Published
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 76 -
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Paper • 2309.04564 • Published • 16 -
LIMA: Less Is More for Alignment
Paper • 2305.11206 • Published • 23 -
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents
Paper • 2302.01560 • Published • 1