Muhammad Farrukh Mehmood

sfarrukh

AI & ML interests

Generative AI, LLM, SLM

Recent Activity

upvoted a paper 8 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

upvoted a paper 8 days ago

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

upvoted an article 10 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

sfarrukh's activity

upvoted 2 papers 8 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 9 days ago • 51

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 17 days ago • 10

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

11 days ago

• 660

updated a dataset 15 days ago

sfarrukh/smart-mf

Viewer • Updated 15 days ago • 100 • 78

published a dataset 15 days ago

sfarrukh/smart-mf

Viewer • Updated 15 days ago • 100 • 78

updated a dataset 15 days ago

sfarrukh/my-distiset-801dd116

Viewer • Updated 15 days ago • 100 • 69

published a dataset 15 days ago

sfarrukh/my-distiset-801dd116

Viewer • Updated 15 days ago • 100 • 69

published a Space 15 days ago

Synth Argila

✍

upvoted an article 16 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

18 days ago

• 33

upvoted an article 17 days ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 44

updated 3 models 17 days ago

published a model 17 days ago

sfarrukh/smollm-smtalk-v1

Text Generation • Updated 17 days ago • 10

upvoted a collection 17 days ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 39

published 2 models 17 days ago

sfarrukh/distilbert-med-v2

Fill-Mask • Updated 17 days ago • 5

sfarrukh/distilbert-da2

Updated 17 days ago

upvoted a paper 18 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 22 days ago • 105

updated a model 18 days ago

sfarrukh/distilbert-da-v1

Fill-Mask • Updated 18 days ago • 2

published a model 18 days ago

sfarrukh/distilbert-da-v1

Fill-Mask • Updated 18 days ago • 2