Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
DPO
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Mixture of Experts
Merge
Eval Results
8-bit precision
Misc with no match
custom_code
text-embeddings-inference
Carbon Emissions
Apply filters
Models
490
Full-text search
Edit filters
Sort: Trending
Active filters:
DPO
Clear all
Izhanjafry/Nous-Hermes-2-Mistral-7B-DPO-Q4_0-GGUF
Updated
Dec 29, 2024
•
118
Nagi-ovo/Llama-3-8B-DPO
Text Generation
•
Updated
24 days ago
•
35
Novaciano/TinyLlama-1b_DPO_Roleplay_NSFW-GGUF
Updated
29 days ago
•
104
tensorblock/Hermes-2-Theta-Llama-3-8B-32k-GGUF
Updated
29 days ago
•
320
mradermacher/Llama3-OpenBioLLM-8B-GGUF
Updated
26 days ago
•
381
mradermacher/Llama3-OpenBioLLM-8B-i1-GGUF
Updated
26 days ago
•
924
MilyaShams/SmolLM2-DPO-FT-smoltalk
Text Generation
•
Updated
23 days ago
•
4
MilyaShams/SmolLM2-DPO-FT-Instruct
Text Generation
•
Updated
23 days ago
•
6
Avibhi/Gemma2-2B-HindiTranslation-DPO
Updated
20 days ago
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
8 days ago
Previous
1
...
15
16
17
Next