SFT final models merged with the base model in full precision, as observed to preserve the results
clembench-project-playpen
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
337
clembench-playpen/Qwen2-7B-DPO_dialogue
Updated
clembench-playpen/Qwen2-7B-DPO_turn
Updated
clembench-playpen/Qwen2-7B-SFT_merged
Text Generation
•
8B
•
Updated
•
50
clembench-playpen/Llama8B_DPO_turn_solved
Updated
clembench-playpen/Qwen2-7B-Instruct
Updated
•
11
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_turn
Updated
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16_dialogue
Updated
clembench-playpen/Qwen2.5-7B-Instruct_dialogue
Updated
clembench-playpen/Mistral-Small-24B-Instruct-less-steps_playpen_SFT-e3_DFINAL_0.35K-steps
Updated
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision_copy_turn
Updated
datasets
51
clembench-playpen/DPO_turn_solved2
Viewer
•
Updated
•
58.9k
•
71
clembench-playpen/DPO_turn_solved
Viewer
•
Updated
•
87.6k
•
90
clembench-playpen/DPO_dialogue
Viewer
•
Updated
•
10.1k
•
41
clembench-playpen/DPO_turn
Viewer
•
Updated
•
87.6k
•
99
clembench-playpen/SFT-Final-Dataset
Viewer
•
Updated
•
7.37k
•
20
clembench-playpen/DPO_turn_allneg_old_and_new
Viewer
•
Updated
•
202k
•
5
clembench-playpen/DPO_turn_allneg_old
Viewer
•
Updated
•
34k
•
6
clembench-playpen/DPO_dialogue_1neg_old
Viewer
•
Updated
•
6.7k
•
2
clembench-playpen/DPO_turn_allneg_old_6m
Viewer
•
Updated
•
34k
•
8
clembench-playpen/DPO_dialogue_1neg_best_models_old_6m
Viewer
•
Updated
•
2.33k
•
3