ME's picture

11 10

ME

meigel

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-R1

upvoted a paper 3 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

upvoted a collection 3 days ago

View all activity

Organizations

None yet

meigel's activity

upvoted a paper 3 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 21 days ago • 87

upvoted 2 collections 3 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 25 days ago • 18

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213

upvoted 2 collections 6 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 7 items • Updated 6 days ago • 13

DeepSeek-R1

8 items • Updated 10 days ago • 307

upvoted a paper 6 days ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published 13 days ago • 14

upvoted a collection 8 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 27 items • Updated about 20 hours ago • 114

upvoted a paper 8 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 22 days ago • 90

upvoted 3 papers 9 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 21 days ago • 86

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 23 days ago • 249

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 11 days ago • 30