Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
17
28
Sangwoo Park
Sangsang
Follow
DongkiKim's profile picture
Fishtiks's profile picture
invincible-jha's profile picture
13 followers
·
26 following
sangwoopark000312
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
published
a model
2 days ago
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
updated
a model
5 days ago
Sangsang/0827_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
View all activity
Organizations
None yet
Papers
2
arxiv:
2505.12805
arxiv:
2503.07216
models
10
Sort: Recently updated
Sangsang/0830_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
2 days ago
•
4
Sangsang/0827_deepseek-r1-controlTokens-20250811-GRPO-LORA-INDIVIDUAL
Updated
5 days ago
•
4
•
1
Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_low_temp
Updated
6 days ago
Sangsang/Qwen2.5-7B-Instruct-penguin-preference_r16
Updated
12 days ago
Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_empty_sys
Updated
13 days ago
Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16_no_sys
Updated
13 days ago
Sangsang/Llama-3.2-3B-Instruct-cat-preference_r16_corrupted
Updated
20 days ago
Sangsang/Llama-3.2-3B-Instruct-cat-preference_r16
Updated
20 days ago
Sangsang/Qwen2.5-7B-Instruct-general_r16
Updated
21 days ago
Sangsang/Qwen2.5-7B-Instruct-cat-preference_r16
Updated
21 days ago
datasets
1
Sangsang/MMLU-Pro-CoT-Eval-Qwen
Updated
14 days ago
•
26