Behrooz Azarkhalili's picture

55 487

Behrooz Azarkhalili

ermiaazarkhalili

·

AI & ML interests

LLMs, VLMs, PEFT, RL for LLMs and VLMs.

Recent Activity

upvoted a paper 11 days ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

upvoted an article 12 days ago

Upskill your LLMs with Gradio MCP Servers

upvoted an article 12 days ago

Generate Images with Claude and Hugging Face

View all activity

Organizations

upvoted a paper 11 days ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published 13 days ago • 20

upvoted 2 articles 12 days ago

Article

Upskill your LLMs with Gradio MCP Servers

By

•

Jul 9

• 19

Article

Generate Images with Claude and Hugging Face

By

•

14 days ago

• 29

upvoted an article 14 days ago

Article

Multimodal RAG with Colpali, Milvus and VLMs

By

•

Dec 10, 2024

• 9

upvoted an article 21 days ago

Article

How I Built 7 Custom Gradio Components in Just 12 Days!

By

•

21 days ago

• 7

upvoted an article 26 days ago

Article

Vision Language Model Alignment in TRL ⚡️

By

and 4 others •

26 days ago

• 75

upvoted a collection about 1 month ago

Qwen3-MegaScience

Qwen3-MegaScience • 5 items • Updated Jul 23 • 3

upvoted a paper about 1 month ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 62

upvoted an article about 1 month ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By

and 4 others •

Jul 29

• 164

upvoted a collection about 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated Jul 12 • 117

upvoted an article about 2 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 63

upvoted a collection 2 months ago

Qwen3

84 items • Updated 27 days ago • 1.18k

upvoted a paper 2 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 57

upvoted an article 2 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 332

upvoted an article 5 months ago

Article

Multi-Label Classification Model From Scratch: Step-by-Step Tutorial

By

•

Jan 8, 2024

• 46

upvoted an article 6 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 37

upvoted an article 7 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

By

and 2 others •

Jan 23

• 182

upvoted an article 10 months ago

Article

Introducing GGUF-my-LoRA

By

•

Nov 1, 2024

• 21

upvoted 2 collections 10 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 290

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 40 items • Updated Jun 23 • 120