Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sangwoo Park's picture
6 17 28

Sangwoo Park

Sangsang
DongkiKim's profile picture Fishtiks's profile picture invincible-jha's profile picture
·
  • sangwoopark000312

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
published a model 2 days ago
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
updated a model 5 days ago
Sangsang/0827_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
View all activity

Organizations

None yet

Papers 2

arxiv:2505.12805
arxiv:2503.07216

models 10

Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL

Updated 2 days ago • 4

Sangsang/0827_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL

Updated 5 days ago • 4 • 1

Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_low_temp

Updated 6 days ago

Sangsang/Qwen2.5-7B-Instruct-penguin-preference_r16

Updated 12 days ago

Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_empty_sys

Updated 13 days ago

Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_no_sys

Updated 13 days ago

Sangsang/Llama-3.2-3B-Instruct-cat-preference_r16_corrupted

Updated 20 days ago

Sangsang/Llama-3.2-3B-Instruct-cat-preference_r16

Updated 20 days ago

Sangsang/Qwen2.5-7B-Instruct-general_r16

Updated 21 days ago

Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16

Updated 21 days ago

datasets 1

Sangsang/MMLU-Pro-CoT-Eval-Qwen

Updated 14 days ago • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs