M^3IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning Paper • 2306.04387 • Published Jun 7, 2023 • 8
Datasets for Large Language Models: A Comprehensive Survey Paper • 2402.18041 • Published Feb 28, 2024 • 2
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark Paper • 2306.06687 • Published Jun 11, 2023 • 1
Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents Paper • 2201.04236 • Published Jan 11, 2022