Edit Models filters

Model Tree

THUDM/LongReward-llama3.1-8b-DPO

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

2

Full-text search

Active filters: THUDM/LongReward-llama3.1-8b-DPO

kromcomp/L3.1-LongReward-r128-LoRA

Updated Nov 23, 2024 • 1

kromcomp/L3.1-LongReward-r16-LoRA

Updated Dec 30, 2024 • 1