1 25 143

peng

superpeng

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

CharlieDreemur/OpenManus-RL

upvoted a paper 15 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

liked a dataset 15 days ago

open-thoughts/OpenThoughts-114k

View all activity

Organizations

None yet

superpeng's activity

liked a dataset 3 days ago

CharlieDreemur/OpenManus-RL

Viewer • Updated about 10 hours ago • 50.8k • 264 • 18

upvoted a paper 15 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 16 days ago • 68

liked a dataset 15 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 22 days ago • 228k • 86.6k • 653

upvoted a collection 15 days ago

Phi-4

Collection

Phi-4 family of small language and multi-modal models. • 7 items • Updated 10 days ago • 109

liked a model 15 days ago

microsoft/Phi-4-mini-instruct

Text Generation • Updated 3 days ago • 156k • 353

liked 2 datasets 15 days ago

FreedomIntelligence/Medical-R1-Distill-Data

Viewer • Updated 20 days ago • 22k • 894 • 24

jdh-algo/Citrus_S3

Preview • Updated 15 days ago • 455 • 8

liked a model 16 days ago

baichuan-inc/Baichuan-M1-14B-Instruct

Updated 22 days ago • 95.6k • 47

liked a dataset 16 days ago

FreedomIntelligence/medical-o1-verifiable-problem

Viewer • Updated Dec 30, 2024 • 40.6k • 1.22k • 75

liked a dataset 19 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k-SFT

Viewer • Updated 23 days ago • 110k • 4.5k • 135

liked a dataset 24 days ago

SPIRAL-MED/o1-journey-Ophiuchus

Viewer • Updated Jan 15 • 5.31k • 63 • 11

upvoted a collection 24 days ago

DeepSeek-R1-ReDistill

Collection

Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30 • 14

liked 2 datasets 24 days ago

mlfoundations-dev/filtered_numina_R1

Viewer • Updated Jan 23 • 34.3k • 167 • 6

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 5.58k • 273

liked 3 datasets 2 months ago

liked a model 2 months ago

FreedomIntelligence/HuatuoGPT-o1-72B

Text Generation • Updated Jan 9 • 120 • 21

liked a dataset 2 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated 20 days ago • 50.1k • 27.9k • 446

liked a dataset 3 months ago

Krystalan/xmediasum

Viewer • Updated Feb 15, 2023 • 40k • 307 • 2