Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper • 2402.14207 • Published Feb 22, 2024 • 7
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 95
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 15 days ago • 59
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 39
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published Jan 1 • 99
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 132
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated Dec 18, 2024 • 18
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published Dec 15, 2024 • 27
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 19 items • Updated 18 days ago • 7