Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Together AI
fal
Cerebras
Nebius AI Studio
Novita
Hyperbolic
Replicate
SambaNova
HF Inference API
Misc
Reset Misc
reward model
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Carbon Emissions
8-bit precision
Eval Results
Mixture of Experts
Misc with no match
Merge
text-embeddings-inference
Apply filters
Models
113
Full-text search
Edit filters
Sort: Trending
Active filters:
reward model
Clear all
nvidia/Nemotron-4-340B-Reward
Updated
Jun 19, 2024
•
27
•
118
Qwen/Qwen2-Math-RM-72B
Text Classification
•
Updated
Sep 18, 2024
•
39
•
5
nvidia/Llama-3.1-Nemotron-70B-Reward-HF
Updated
Oct 15, 2024
•
422
•
82
Qwen/Qwen2.5-Math-PRM-72B
Text Classification
•
Updated
Jan 17
•
405
•
71
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
Mar 20, 2024
•
15.8k
•
558
CallComply/Starling-LM-11B-alpha
Text Generation
•
Updated
Mar 4, 2024
•
1.69k
•
13
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
Apr 3, 2024
•
3.37k
•
343
johnsnowlabs/JSL-MedMNX-7B
Text Generation
•
Updated
Apr 18, 2024
•
2.89k
•
5
nvidia/Llama3-70B-SteerLM-RM
Updated
Jun 19, 2024
•
12
•
43
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
Oct 31, 2024
•
13.5k
•
77
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
Oct 15, 2024
•
32
•
71
Qwen/Qwen2.5-Math-7B-PRM800K
Text Classification
•
Updated
Jan 17
•
2.47k
•
14
Qwen/Qwen2.5-Math-PRM-7B
Text Classification
•
Updated
Jan 17
•
32.1k
•
61
prithivMLmods/PRM-Math-7B-Reasoner
Text Classification
•
Updated
Jan 19
•
49
•
1
internlm/internlm-xcomposer2d5-7b-reward
Any-to-Any
•
Updated
Jan 28
•
1.46k
•
8
mradermacher/Starling-LM-11B-alpha-GGUF
Updated
Feb 9
•
104
•
1
mradermacher/Starling-LM-11B-alpha-i1-GGUF
Updated
Feb 10
•
234
•
1
nicholasKluge/RewardModelPT
Text Classification
•
Updated
Jun 18, 2024
•
21
nicholasKluge/RewardModel
Text Classification
•
Updated
Jun 18, 2024
•
45
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
8
•
23
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
9
berkeley-nest/Starling-RM-7B-alpha
Updated
Jul 30, 2024
•
28
•
102
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
4
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
5
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
5
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
5
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
Nov 27, 2023
•
7
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
Nov 28, 2023
•
2.1k
•
94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
Nov 28, 2023
•
19
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
Mar 20, 2024
•
62
•
3
Previous
1
2
3
4
Next