Running 2.23k 2.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated Dec 18, 2024 • 18
royleibov/Llama-3.2-11B-Vision-Instruct-ZipNN-Compressed Image-Text-to-Text • Updated Sep 26, 2024 • 67 • 4
royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed Image-Text-to-Text • Updated Sep 15, 2024 • 26 • 1
royleibov/solar-pro-preview-instruct-ZipNN-Compressed Text Generation • Updated Sep 18, 2024 • 28 • 1
royleibov/Phi-3.5-mini-instruct-ZipNN-Compressed Text Generation • Updated Sep 19, 2024 • 41 • 1
royleibov/granite-3b-code-base-128k-ZipNN-Compressed Text Generation • Updated Oct 3, 2024 • 30 • 2