-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 4 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 3 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 3 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 3
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Organizations
RLAD
-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 4 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 3 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 3 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 3
Hint Generation
models
80
CohenQu/Qwen3-4B-Instruct-POPE-MIX-first_guide-no_guide-v3-0.26
4B
•
Updated
•
510
CohenQu/Qwen3-4B-Instruct-POPE-MIX-no_guide-v3-0.26
4B
•
Updated
•
322
CohenQu/Qwen3-4B-Instruct-POPE-hard-first_guide-no_guide-v2
4B
•
Updated
•
576
CohenQu/Qwen3-4B-Instruct-POPE-hard-no_guide-v2
4B
•
Updated
•
161
CohenQu/Qwen3-4B-Instruct-POPE-hard-first_guide-no_guide_cphigh_0.26
4B
•
Updated
•
256
CohenQu/POPE-SFT-iter3-POPE-hard-no_guide
4B
•
Updated
•
170
CohenQu/POPE-hard-first_guide-no_guide_0.2
4B
•
Updated
•
140
CohenQu/Instruct-POPE-hard-first_guide-no_guide
4B
•
Updated
•
325
CohenQu/Qwen3-1.7B_Continue_vs_Terminate.06.00
Text Generation
•
2B
•
Updated
•
12
CohenQu/Qwen2.5-SFT-Continue_vs_Terminate.05-verl
3B
•
Updated
•
7
datasets
398
CohenQu/POPE-MIX-first_guide-no_guide-0.0-0.32-1024-verl
Viewer
•
Updated
•
2.29k
CohenQu/POPE-MIX-first_guide-no_guide-0.0-0.64-1024-verl
Viewer
•
Updated
•
2.29k
CohenQu/Polaris-AceReason-Math-combined-0.0-0.32
Viewer
•
Updated
•
1.02k
CohenQu/Polaris-AceReason-Math-combined-0.0-0.64
Viewer
•
Updated
•
1.02k
CohenQu/POPE-hard-eval-prompts
Viewer
•
Updated
•
229
•
434
CohenQu/POPE-MIX-first_guide-no_guide-0.0-0.0625-1024-verl
Viewer
•
Updated
•
2.29k
•
7
CohenQu/Polaris-AceReason-Math-combined-0.0-0.0625
Viewer
•
Updated
•
2.86k
•
6
CohenQu/POPE-MIX-first_guide-no_guide-0.0-0.125-1024-verl
Viewer
•
Updated
•
2.29k
•
7
CohenQu/Polaris-AceReason-Math-combined-0.0-0.125
Viewer
•
Updated
•
4.55k
•
4
CohenQu/AceReason-Math-0-0.125
Viewer
•
Updated
•
704
•
5