CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation Paper • 2409.02098 • Published Sep 3, 2024 • 1
MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions Paper • 2409.12958 • Published Sep 19, 2024 • 8
LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction Paper • 2304.08460 • Published Apr 17, 2023 • 3
Hallucination Augmented Recitations for Language Models Paper • 2311.07424 • Published Nov 13, 2023