Aramis

amenur

amenur

AI & ML interests

None yet

Recent Activity

upvoted an article 2 days ago

Open-source DeepResearch – Freeing our search agents

upvoted an article 3 days ago

Introducing smolagents: simple agents that write actions in code.

upvoted an article 4 days ago

Open-R1: Update #1

View all activity

Organizations

None yet

amenur's activity

upvoted an article 2 days ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 702

upvoted an article 3 days ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 569

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 239

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 657

upvoted an article about 1 month ago

Article

Superposition in Transformers: A Novel Way of Building Mixture of Experts

•

Jan 4

• 14

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted a collection about 2 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted a paper 4 months ago

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages

Paper • 2410.01036 • Published Oct 1, 2024 • 14

upvoted an article 4 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 182

upvoted a collection 5 months ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted 2 articles 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 217

Article

Scaling robotics datasets with video encoding

Aug 27, 2024

• 37

upvoted 2 articles 6 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 271

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

• 63

upvoted a paper 8 months ago

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 29

upvoted 2 articles 8 months ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

•

Jun 20, 2024

• 26

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 421

upvoted 3 articles 9 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 128

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 44

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23, 2024

• 33