Ahmed Mostafa

AhmedMostafa

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

AhmedMostafa/bge-reranker-v2-gemma-saudi-achi10

published a model 7 days ago

AhmedMostafa/bge-reranker-v2-gemma-saudi-achi10

liked a Space 20 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

AhmedMostafa's activity

updated a model 7 days ago

AhmedMostafa/bge-reranker-v2-gemma-saudi-achi10

Text Classification • Updated 7 days ago • 7

published a model 7 days ago

AhmedMostafa/bge-reranker-v2-gemma-saudi-achi10

Text Classification • Updated 7 days ago • 7

liked a Space 20 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

updated a model about 1 month ago

AhmedMostafa/DeepSeek-R1-Medical-COT

Text Generation • Updated Feb 6 • 18

published a model about 1 month ago

AhmedMostafa/DeepSeek-R1-Medical-COT

Text Generation • Updated Feb 6 • 18

authored a paper about 1 month ago

EXAdam: The Power of Adaptive Cross-Moments

Paper • 2412.20302 • Published Dec 29, 2024 • 2

upvoted a paper about 1 month ago

EXAdam: The Power of Adaptive Cross-Moments

Paper • 2412.20302 • Published Dec 29, 2024 • 2

upvoted an article 6 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 225

updated a model 7 months ago

AhmedMostafa/BinNet-88.1M

Updated Aug 6, 2024 • 7

liked 3 datasets 9 months ago

liked a model 9 months ago

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated 11 days ago • 107k • • 1.64k

liked a dataset 10 months ago

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23, 2024 • 1.95k • 146 • 24

liked a model 10 months ago

NousResearch/Hermes-2-Theta-Llama-3-8B

Text Generation • Updated Sep 8, 2024 • 9.13k • 201

updated a model 10 months ago

AhmedMostafa/Lumin-88.1M

Text Generation • Updated May 30, 2024 • 28

updated a model about 2 years ago

AhmedMostafa/DialoGPT-small-Rick

Text Generation • Updated Dec 29, 2022 • 183