Richard Zhuang's picture

Richard Zhuang PRO

RZ412

·

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset about 2 hours ago

DCAgent2/terminal_bench_2_Qwen2_5_Coder_32B_Instruct_20260315_175308

published a dataset about 2 hours ago

DCAgent2/terminal_bench_2_Qwen2_5_Coder_32B_Instruct_20260315_175308

updated a dataset about 2 hours ago

DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_a42f89b72

View all activity

Organizations

Papers 2

arxiv:2501.08328

arxiv:2410.02223

models 57

RZ412/Qwen2.5-3B-Instruct-inferredbugs-sandboxes-traces-terminus-2

Updated Dec 4, 2025

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Min-R1-Min-MLR

Text Generation • 3B • Updated Nov 30, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RM-50-50-SS-42-AS-42

Text Generation • 3B • Updated Nov 3, 2025 • 5

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-Only-Seed-42

Text Generation • 3B • Updated Nov 3, 2025 • 1

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-MeL

Text Generation • 3B • Updated Oct 28, 2025 • 3

RZ412/Qwen2.5-3B-Instruct-OT3-8K-R1-ML

Text Generation • 3B • Updated Oct 27, 2025 • 2

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-MaL-misstore

Text Generation • 3B • Updated Oct 27, 2025 • 3

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-DB

Text Generation • 3B • Updated Oct 26, 2025 • 3

RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RES

Text Generation • 3B • Updated Oct 26, 2025 • 3

datasets 20

RZ412/PokerBench

Viewer • Updated Jan 8 • 574k • 1.35k • 34

RZ412/db-test-traces

Viewer • Updated Dec 10, 2025 • 210 • 7

RZ412/test-parquet2

Viewer • Updated Dec 6, 2025 • 728 • 7

RZ412/test-parquet

Viewer • Updated Dec 6, 2025 • 728 • 6

RZ412/inferredbugs-traces-sft

Viewer • Updated Dec 5, 2025 • 8

RZ412/inferredbugs-tasks

Viewer • Updated Dec 5, 2025 • 100 • 7

RZ412/inferredbugs-10

Viewer • Updated Dec 5, 2025 • 10 • 8

RZ412/inferredbugs-traces-10

Viewer • Updated Dec 5, 2025 • 12

RZ412/inferredbugs-sandboxes-10

Viewer • Updated Dec 5, 2025 • 10 • 11

RZ412/inferredbugs-10-traces

Viewer • Updated Dec 5, 2025 • 6

View 20 datasets