-
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 13 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 87
Kevin-Brian N'Diaye
kevin-nd
·
AI & ML interests
- Computer Vision
- Vision-Language-Action Models
Recent Activity
upvoted a paper 5 days ago
ViT-5: Vision Transformers for The Mid-2020s updated a model 14 days ago
kevin-nd/resnet50 published a model 14 days ago
kevin-nd/resnet50Organizations
None yet