Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Carbon Emissions

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Models

113

Full-text search

Active filters: reward model

nvidia/Nemotron-4-340B-Reward

Updated Jun 19, 2024 • 27 • 118

Qwen/Qwen2-Math-RM-72B

Text Classification • Updated Sep 18, 2024 • 39 • 5

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated Oct 15, 2024 • 422 • 82

Qwen/Qwen2.5-Math-PRM-72B

Text Classification • Updated Jan 17 • 405 • 71

berkeley-nest/Starling-LM-7B-alpha

Text Generation • Updated Mar 20, 2024 • 15.8k • 558

CallComply/Starling-LM-11B-alpha

Text Generation • Updated Mar 4, 2024 • 1.69k • 13

Nexusflow/Starling-LM-7B-beta

Text Generation • Updated Apr 3, 2024 • 3.37k • 343

johnsnowlabs/JSL-MedMNX-7B

Text Generation • Updated Apr 18, 2024 • 2.89k • 5

nvidia/Llama3-70B-SteerLM-RM

Updated Jun 19, 2024 • 12 • 43

Qwen/Qwen2.5-Math-RM-72B

Text Classification • Updated Oct 31, 2024 • 13.5k • 77

nvidia/Llama-3.1-Nemotron-70B-Reward

Updated Oct 15, 2024 • 32 • 71

Qwen/Qwen2.5-Math-7B-PRM800K

Text Classification • Updated Jan 17 • 2.47k • 14

Qwen/Qwen2.5-Math-PRM-7B

Text Classification • Updated Jan 17 • 32.1k • 61

prithivMLmods/PRM-Math-7B-Reasoner

Text Classification • Updated Jan 19 • 49 • 1

internlm/internlm-xcomposer2d5-7b-reward

Any-to-Any • Updated Jan 28 • 1.46k • 8

mradermacher/Starling-LM-11B-alpha-GGUF

Updated Feb 9 • 104 • 1

mradermacher/Starling-LM-11B-alpha-i1-GGUF

Updated Feb 10 • 234 • 1

nicholasKluge/RewardModelPT

Text Classification • Updated Jun 18, 2024 • 21

nicholasKluge/RewardModel

Text Classification • Updated Jun 18, 2024 • 45

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 8 • 23

fnlp/moss-rlhf-reward-model-7B-en

Updated Jul 13, 2023 • 9

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30, 2024 • 28 • 102

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 4

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 1

LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 2

LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 5 • 1

LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2

Text Generation • Updated Nov 27, 2023 • 7 • 2

TheBloke/Starling-LM-7B-alpha-GGUF

Updated Nov 28, 2023 • 2.1k • 94

TheBloke/Starling-LM-7B-alpha-AWQ

Text Generation • Updated Nov 28, 2023 • 19 • 9

second-state/Starling-LM-7B-alpha-GGUF

Text Generation • Updated Mar 20, 2024 • 62 • 3