77 64 173

Caleb Fahlgren PRO

cfahlgren1

AI & ML interests

None yet

Recent Activity

updated a dataset about 8 hours ago

cfahlgren1/hub-stats

authored a paper about 19 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

liked a Space about 21 hours ago

m-ric/open_Deep-Research

View all activity

Organizations

cfahlgren1's activity

updated a dataset about 8 hours ago

cfahlgren1/hub-stats

Viewer • Updated about 8 hours ago • 2.04M • 808 • 22

authored a paper about 19 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 108

liked a Space about 21 hours ago

131

Open Deep-Research

🏆

OpenAI's Deep Research, but open

upvoted a paper about 21 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 108

updated 2 datasets 1 day ago

duckdb-nsql-hub/duckdb-nsql-scores

Viewer • Updated 1 day ago • 124 • 97

duckdb-nsql-hub/duckdb-nsql-predictions

Viewer • Updated 1 day ago • 3.3k • 24

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 702

upvoted a paper 7 days ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 8 days ago • 25

upvoted an article 7 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

7 days ago

• 29

updated a dataset 7 days ago

cfahlgren1/gpt-4o-function-calling-traces

Viewer • Updated 7 days ago • 2 • 71 • 2

upvoted an article 8 days ago

Article

How to deploy and fine-tune DeepSeek models on AWS

8 days ago

• 35

liked a model 8 days ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 8 days ago • 5.29k • 199

liked a dataset 8 days ago

open-r1/OpenThoughts-114k-math

Viewer • Updated 8 days ago • 89.1k • 772 • 42

liked 3 models 8 days ago

upvoted a collection 8 days ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 88

liked a dataset 8 days ago

cognitivecomputations/dolphin-r1

Viewer • Updated 8 days ago • 814k • 1.79k • 201

updated 2 datasets 8 days ago

cfahlgren1/react-code-instructions

Viewer • Updated 8 days ago • 74.4k • 1.3k • 136

cfahlgren1/react-code-instructions

Viewer • Updated 8 days ago • 74.4k • 1.3k • 136