Open to Work

20 151 165

Aurélien-Morgan CLAUDON

Aurelien-Morgan

https://huggingface.co/retrain-pipelines

AI & ML interests

None yet

Recent Activity

reacted to danielhanchen's post with ❤️ 3 days ago

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥 Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM. GGUF: https://huggingface.co/unsloth/Nemotron-3-Nano-30B-A3B-GGUF 💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

replied to their post 14 days ago

Hey, I went to Hangzhou to talk about `retrain-pipelines` at the GOSIM Foundation's conference last september. The recording just got released. Go check it out ! https://www.youtube.com/watch?v=nmrMachM5aM Slides are there : https://docs.google.com/presentation/d/1hnAzHJ0SbeAOtGJir-iH84RBtXT1OxVT/

posted an update 16 days ago

View all activity

Organizations

reacted to danielhanchen's post with ❤️ 3 days ago

Post

5189

NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! 🔥

Has 1M context window & best in class performance for SWE-Bench, reasoning & chat. Run the MoE model locally with 24GB RAM.

GGUF: unsloth/Nemotron-3-Nano-30B-A3B-GGUF
💚 Step-by-step Guide: https://docs.unsloth.ai/models/nemotron-3

1 reply

replied to their post 14 days ago

Thanks Victor !
And, that's actually a QR-code to an article I published here but, yeah, QR-code for profile would be useful. QR-code for model / dataset / Space / Paper, when ? 😀

posted an update 16 days ago

Post

293

Hey, I went to Hangzhou to talk about retrain-pipelines at the GOSIM Foundation's conference last september.
The recording just got released. Go check it out !
https://www.youtube.com/watch?v=nmrMachM5aM
Slides are there :
https://docs.google.com/presentation/d/1hnAzHJ0SbeAOtGJir-iH84RBtXT1OxVT/

2 replies

liked a model 27 days ago

Tongyi-MAI/Z-Image-Turbo

Text-to-Image • Updated 18 days ago • 395k • • 3.43k

updated a Space 27 days ago

README

📈

liked a Space 29 days ago

The Eiffel Tower Llama

📝

Explore the Eiffel Tower Llama experiment with open-source models

upvoted an article 30 days ago

Article

Continuous batching from first principles

Nov 25

•

283

replied to sergiopaniego's post about 1 month ago

🙋

reacted to sergiopaniego's post with 🔥 about 2 months ago

Post

5381

fine-tuning a 14B model with TRL + SFT on a free Colab (T4 GPU)?
thanks to the latest TRL optimizations, you actually can!
sharing a new notebook showing how to do it 😎

colab: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_trl_lora_qlora.ipynb

notebooks in TRL: https://github.com/huggingface/trl/tree/main/examples/notebooks

2 replies

upvoted an article about 2 months ago

Article

Streaming datasets: 100x More Efficient

Oct 27

•

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.68k

The secrets to building world-class LLMs

upvoted 3 articles about 2 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30

•

Article

What makes good reasoning data

Oct 30

•

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

upvoted an article 2 months ago

Article

Hugging Face and VirusTotal collaborate to strengthen AI security

Oct 22

•

upvoted a paper 2 months ago

Blackbox Model Provenance via Palimpsestic Membership Inference

Paper • 2510.19796 • Published Oct 22 • 3

liked a model 2 months ago

katanemo/Arch-Router-1.5B

Text Generation • 2B • Updated Nov 16 • 3.13k • • 236

liked a Space 2 months ago

Robot Learning: A Tutorial

📝

279

Read and explore a tutorial on robot learning

upvoted a paper 2 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 117

reacted to prithivMLmods's post with 👍 3 months ago

Post

5243

Dropping some experimental adapters for FLUX.1-Kontext-dev, including Photo-Restore-i2i, PhotoCleanser-i2i, Polaroid-Warm-i2i, Yarn-Photo-i2i, and Monochrome-Pencil. These were trained under various settings with minimal image pairs to achieve optimal results. The dataset result sets end pairs were synthesized using Gemini-2.5-Flash-Image-Preview and others.🤗✨

prithivMLmods/PhotoCleanser-i2i: Remove objects while preserving the rest of the image.
prithivMLmods/Photo-Restore-i2i: Restore old photos into moderately colorized, detailed images.
prithivMLmods/Polaroid-Warm-i2i: Seamless vintage Polaroid-style images with warm, faded tones.
prithivMLmods/Yarn-Photo-i2i: Convert images into yarn-stitched artwork while retaining key details.
prithivMLmods/Monochrome-Pencil: Turn images into monochrome pencil sketches while keeping original features.

✨Note: All the above models share the same auto-labeling multimodal VLM captioning model, prithivMLmods/DeepCaption-VLA-7B, which is used for refining edit instructions and accurately understanding attributions for the generations.

✨Collection: prithivMLmods/i2i-kontext-exp-68ce573b5c0623476b636ec7

.
.
.
To know more about it, visit the app page or the respective model page!!

Aurélien-Morgan CLAUDON

AI & ML interests

Recent Activity

Organizations

Aurelien-Morgan's activity

README

The Eiffel Tower Llama

Continuous batching from first principles

Streaming datasets: 100x More Efficient

The Smol Training Playbook

Why Did MiniMax M2 End Up as a Full Attention Model?

What makes good reasoning data

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Hugging Face and VirusTotal collaborate to strengthen AI security

Robot Learning: A Tutorial