chansung park's picture

chansung park PRO

chansung

·

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

Hiconcep/Neural-MRI

published a model 18 days ago

chansung/diffusion-dpo

authored a paper 25 days ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

View all activity

Organizations

Posts 21

Post

4671

YAML engineering becomes more and more important than ever from infra provisioning to model training (recipes).

Here, I built a simple editor first for @dstackai , and I will share the live endpoint this week. Let me know what you think about this approach.

Based on this approach, if people think this is useful, I am going to do the same thing for the LLM training recipes for popular frameworks such as Hugging Face open-r1, Axolotl, and so on. Let me hear.

Articles 7

Article

4

Fashion Moodboard with Gemini 3 & Nano Banana Pro

View all Articles

Papers 3

arxiv:2602.15449

arxiv:2412.06071

arxiv:2408.13467

spaces 50

Paper Q&A

Explore papers with auto generated Q&As

Llama2 With Gradio Chat

Zero2Story

Create a custom story with characters and plot

Co Write With Llama2

LLMs As Chatbot

Anycoder 1ac74a10

Explore an interactive and visually stunning web page

models 184

chansung/diffusion-dpo

Updated 18 days ago

chansung/Qwen2.5-7B-CCRL-CUR-BINARY2-ONLY-1E

Text Generation • 8B • Updated Nov 23, 2025 • 5

chansung/Qwen2.5-1.5B-CCRL-CUR-BINARY2-ONLY-1E

Text Generation • 2B • Updated Nov 23, 2025 • 3

chansung/Qwen2.5-7B-CCRL-CUR-BINARY-ONLY-1E

Text Generation • 8B • Updated Nov 22, 2025 • 7

chansung/Qwen2.5-1.5B-CCRL-CUR-BINARY-ONLY-1E

Text Generation • 2B • Updated Nov 21, 2025 • 6

chansung/Qwen2.5-Coder-7B-CCRL-CUR-BINARY-ONLY-1E

Updated Nov 20, 2025

chansung/Qwen2.5-Coder-7B-UCRL

8B • Updated Nov 9, 2025 • 3

chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO

Text Generation • 2B • Updated Nov 4, 2025 • 1

chansung/Qwen3-4B-CCRL-CUR-VAR-ASCE-NORMAL-1E-LOG

4B • Updated Sep 23, 2025

chansung/Qwen3-4B-CCRL-CUR-VAR-ASCE-NORMAL-2E

Text Generation • 4B • Updated Sep 16, 2025

View 184 models

datasets 59

chansung/verifiable-coding-problems-python-v2

Viewer • Updated Apr 21, 2025 • 15.5k • 39

chansung/verifiable-coding-problems-python

Viewer • Updated Mar 29, 2025 • 949 • 30

chansung/openthoughts-coding-llama-factory

Viewer • Updated Mar 12, 2025 • 19.9k • 6

chansung/cqa_synth_ds

Viewer • Updated Jun 3, 2024 • 111k • 9

chansung/coding_synth_ds

Viewer • Updated Jun 3, 2024 • 116k • 15 • 1

chansung/classification_synth_ds

Viewer • Updated Jun 2, 2024 • 92.3k • 18

chansung/classification_synth_ds2

Viewer • Updated Jun 1, 2024 • 424 • 7

chansung/aaa3

Updated Jun 1, 2024 • 8

chansung/aaa2

Updated Jun 1, 2024 • 6

chansung/synth_summarize_dataset

Viewer • Updated May 31, 2024 • 880k • 56 • 1

View 59 datasets