-
-
-
-
-
-
Inference Providers
Active filters:
dpo
NicholasCorrado/uf-rlced-conifer_tulu-2-7b-dpo-full
Text Generation
•
Updated
•
5
NicholasCorrado/uf-rlced-conifer-3-1-tinyllama-1.1b-chat-v1.0-dpo-full
Text Generation
•
Updated
•
16
NicholasCorrado/uf-tulu-2-7b-dpo-full
Text Generation
•
Updated
•
5
CharlesLi/OpenELM-1_1B-DPO-full-1-5
Text Generation
•
Updated
•
12
CharlesLi/OpenELM-1_1B-DPO-full-2-5
Text Generation
•
Updated
•
12
CharlesLi/OpenELM-1_1B-DPO-full-3-5
Text Generation
•
Updated
•
10
claudiubarbu/dpo
Text Generation
•
Updated
•
18
YYYYYYibo/approx_nash_again_1_iter_2
mradermacher/TwinLlama-3.1-8B-DPO-GGUF
Updated
•
68
bartowski/Fireball-3.1-8B-ORPO-GGUF
Text Generation
•
Updated
•
61
mradermacher/TwinLlama-3.1-8B-DPO3-GGUF
Updated
•
20
mradermacher/TwinLlama-3.1-8B-DPO2-GGUF
Updated
•
19
nomadrp/tq-llama3.1-gt2
LouisSanna/dpo-model-output
Text Generation
•
Updated
•
19
YYYYYYibo/approx_nash_again_1_iter_3
NicholasCorrado/uf-rlced-conifer-zephyr-7b-group-dpo-full
Text Generation
•
Updated
•
5
YYYYYYibo/two_agent_1_epoch_2_dpo_iter_6
mradermacher/uf-tulu-2-7b-dpo-full-GGUF
Updated
•
36
NicholasCorrado/uf-rlced-conifer_tulu-2-7b-group-dpo
Text Generation
•
Updated
•
5
QuantFactory/TwinLlama-3.1-8B-DPO-GGUF
Updated
•
25
•
3
QuantFactory/TwinLlama-3.1-8B-DPO2-GGUF
Updated
•
188
•
3
NicholasCorrado/tulu-2-7b-hh-dpo
Text Generation
•
Updated
•
6
NicholasCorrado/uf-rlced-conifer-zephyr-7b-group-dpo-no-clip-no-excess
Text Generation
•
Updated
•
25
NicholasCorrado/uf-rlced-conifer-zephyr-7b-group-dpo-no-clip
Text Generation
•
Updated
•
5
QuantFactory/TwinLlama-3.1-8B-DPO3-GGUF
Updated
•
71
•
3
bachephysicdun/HW2-dpo
Text Generation
•
Updated
•
8
bachephysicdun/HW2-orpo
Text Generation
•
Updated
•
8
NicholasCorrado/zephyr-7b-hh-dpo
Text Generation
•
Updated
•
6
SimaFarazi/gpt2-dpo
Text Generation
•
Updated
•
9
sumitxenon/HW2-dpo
Text Generation
•
Updated
•
6