-
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
Paper • 2309.11674 • Published • 31 -
Boolformer: Symbolic Regression of Logic Functions with Transformers
Paper • 2309.12207 • Published • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2309.12307
-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 85 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 54 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 38 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88
-
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 87 -
NExT-GPT: Any-to-Any Multimodal LLM
Paper • 2309.05519 • Published • 78 -
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Paper • 2307.16430 • Published • 4 -
Agents: An Open-source Framework for Autonomous Language Agents
Paper • 2309.07870 • Published • 42
-
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper • 2309.06180 • Published • 25 -
Ambiguity-Aware In-Context Learning with Large Language Models
Paper • 2309.07900 • Published • 5 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 53 -
LASER: LLM Agent with State-Space Exploration for Web Navigation
Paper • 2309.08172 • Published • 13
-
Agents: An Open-source Framework for Autonomous Language Agents
Paper • 2309.07870 • Published • 42 -
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts
Paper • 2309.07430 • Published • 27 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 53 -
Investigating Answerability of LLMs for Long-Form Question Answering
Paper • 2309.08210 • Published • 14
-
MVDream: Multi-view Diffusion for 3D Generation
Paper • 2308.16512 • Published • 102 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper • 2309.00267 • Published • 48 -
MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records
Paper • 2308.14089 • Published • 30 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 53
-
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 87 -
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
Paper • 2309.04564 • Published • 16 -
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 54 -
The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute
Paper • 2309.11197 • Published • 5
-
TheBirdLegacy/FreeLoaderLM
Text Generation • Updated -
CofeAI/FLM-101B
Text Generation • Updated • 107 • 91 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper • 2309.03852 • Published • 44 -
Composable Function-preserving Expansions for Transformer Architectures
Paper • 2308.06103 • Published • 20
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 76 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 33 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 42 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 32
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 76 -
Challenges and Applications of Large Language Models
Paper • 2307.10169 • Published • 48 -
Efficiently Modeling Long Sequences with Structured State Spaces
Paper • 2111.00396 • Published • 3 -
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning
Paper • 2006.08381 • Published