Siqi Guo
siqi00
AI & ML interests
None yet
Recent Activity
authored
a paper
12 days ago
Discriminative Finetuning of Generative Large Language Models without
Reward Models and Preference Data
Organizations
None yet
models
2
datasets
9
siqi00/mistral_ultrafeedback_unhelpful_chatprompt_0.7_1.0_50_320
Viewer
•
Updated
•
61.1k
•
73
siqi00/qwen_openr1math
Viewer
•
Updated
•
93.7k
•
29
siqi00/mistral_uf_clean
Viewer
•
Updated
•
60.9k
•
30
siqi00/qwen2.5_openthoughts_0.7_1.0_-1_8192
Viewer
•
Updated
•
114k
•
49
siqi00/ultrafeedback_binarized
Viewer
•
Updated
•
187k
•
42
siqi00/mistral_ultrafeedback_unhelpful_chatprompt_100.0_1.0_-1_320
Viewer
•
Updated
•
61.1k
•
49
siqi00/mistral_metamath_question_0.7_1.0_50_256
Viewer
•
Updated
•
395k
•
61
siqi00/llama3_tulu_0.8_0.95_-1_2048
Viewer
•
Updated
•
427k
•
38
siqi00/llama2_tulu_0.7_1.0_50_384
Viewer
•
Updated
•
326k
•
36