6 124 63

Quentin Tardif

ntnq

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted an article 2 days ago

Open-source DeepResearch – Freeing our search agents

upvoted a paper 4 days ago

s1: Simple test-time scaling

View all activity

Organizations

ntnq's activity

upvoted a paper about 22 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 110

upvoted an article 2 days ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 702

upvoted a paper 4 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 7 days ago • 88

upvoted an article 5 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 239

upvoted an article 7 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

7 days ago

• 29

upvoted 2 papers 9 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 10 days ago • 100

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 10 days ago • 32

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 657

upvoted a paper 15 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 16 days ago • 302

upvoted a collection 17 days ago

DeepSeek-R1

Collection

8 items • Updated 17 days ago • 429

upvoted a paper 17 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 21 days ago • 105

upvoted an article 23 days ago

Article

Run ComfyUI workflows for free on Spaces

Jan 14, 2024

• 50

upvoted an article 30 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

Jan 3

• 32

upvoted 4 papers about 1 month ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5 • 41

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published Dec 30, 2024 • 13

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

upvoted 3 papers about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 54

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106