ybendou/Al-Atlas-0.5B-bs-2-lr-0.0001-ep-3-wp-0.1-gacc-16-gnm-1.0-FP16-SFT-mx-2048-v5 Updated about 13 hours ago
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others • 4 days ago • 117
ybendou/Al-Atlas-0.5B-bs-2-lr-0.001-ep-3-wp-0.1-gacc-16-gnm-1.0-FP16-SFT-mx-2048-v5 Text Generation • Updated 1 day ago • 3
ybendou/Al-Atlas-0.5B-bs-2-lr-0.001-ep-3-wp-0.1-gacc-16-gnm-1.0-FP16-SFT-mx-2048-v5 Text Generation • Updated 1 day ago • 3
view article Article Atlaset Dataset for Moroccan Darija: From Data Collection, Analysis, to Model Trainings By atlasia and 1 other • 8 days ago • 18
EASY: Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients Paper • 2201.09699 • Published Jan 24, 2022 • 2
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Paper • 2501.11175 • Published Jan 19 • 3
EASY: Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients Paper • 2201.09699 • Published Jan 24, 2022 • 2
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models Paper • 2501.11175 • Published Jan 19 • 3
Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters