1 44 182

Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

updated a dataset 3 days ago

xszheng2020/s1K_tokenized_llama

published a dataset 3 days ago

xszheng2020/s1K_tokenized_llama

liked a dataset 7 days ago

simplescaling/data_ablation_full59K

View all activity

Organizations

xszheng2020's activity

updated a dataset 3 days ago

xszheng2020/s1K_tokenized_llama

Viewer • Updated 3 days ago • 1k • 41

published a dataset 3 days ago

xszheng2020/s1K_tokenized_llama

Viewer • Updated 3 days ago • 1k • 41

liked a dataset 7 days ago

simplescaling/data_ablation_full59K

Viewer • Updated Feb 3 • 60.4k • 2.08k • 18

upvoted a paper 7 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 83

liked a model 9 days ago

m-a-p/neo_7b

Text Generation • Updated Jun 3, 2024 • 207 • 54

upvoted a collection 10 days ago

OLMo 2 Preview Post-trained Models

Collection

These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions. • 6 items • Updated about 5 hours ago • 3

liked a model 10 days ago

allenai/OLMo-2-1124-7B-Instruct

Text Generation • Updated Jan 6 • 12.9k • 30

liked a dataset 10 days ago

simplescaling/s1K-1.1

Viewer • Updated 14 days ago • 1k • 7.07k • 86

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

liked a model about 1 month ago

allenai/OLMo-1B-0724-hf

Text Generation • Updated Aug 5, 2024 • 98.8k • 19

liked a dataset about 1 month ago

lkevinzc/CountDownZero

Viewer • Updated Feb 1 • 329k • 123 • 1

liked a model about 1 month ago

Qwen/Qwen2.5-0.5B

Text Generation • Updated Sep 25, 2024 • 526k • • 229

upvoted an article about 1 month ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 295

liked a dataset about 1 month ago

Jiayi-Pan/Countdown-Tasks-3to4

Viewer • Updated Jan 23 • 490k • 15.2k • 46

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 18 days ago • 10.6k • 863

upvoted a collection about 2 months ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 571

liked 2 models 2 months ago

microsoft/phi-4

Text Generation • Updated 17 days ago • 505k • • 1.9k

Snowflake/snowflake-arctic-embed-m-v2.0

upvoted an article 2 months ago

Article

Fine-tune ModernBERT for text classification using synthetic data

•

Dec 30, 2024

• 32

liked a dataset 2 months ago

davidbrandfonbrener/color-filtered-c4

Viewer • Updated Jun 18, 2024 • 5.14M • 115 • 3