-
-
-
-
-
-
Inference Providers
Active filters:
dpo
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
•
5
Jatin313/tiny-chatbot-dpo
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
26
bartowski/TwinLlama-3.1-8B-DPO3-GGUF
Text Generation
•
Updated
•
41
nomadrp/tq-aya101-6langs
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
25
tsavage68/Na_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e8rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_300steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
4
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
•
26
KoNqUeRoR3891/HW2-dpo
Text Generation
•
Updated
•
39
nomadrp/tq-aya101-gt2
nomadrp/tq-llama3.1-gt3
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
Updated
•
12
nomadrp/tq-llama3.1-sent-shlfd-gt3
QuantFactory/Lama-DPOlphin-8B-GGUF
Text Generation
•
Updated
•
215
•
2
LBK95/Llama-2-7b-hf-DPO-LookAhead5_FullEval_TTree1.4_TLoop0.7_TEval0.2_V1.0
Wenboz/zephyr-7b-wpo-lora
YYYYYYibo/gshf_ours_1_iter_2
Triangle104/NeuralDaredevil-8B-abliterated-Q4_K_M-GGUF
Triangle104/NeuralDaredevil-8B-abliterated-Q4_0-GGUF
Triangle104/NeuralDaredevil-8B-abliterated-Q4_K_S-GGUF