-
-
-
-
-
-
Inference Providers
Active filters:
rlhf
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
146
•
10
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
16
•
12
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
•
2.06k
•
16
PKU-Alignment/beaver-dam-7b
Updated
•
1.36k
•
6
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
2k
•
9
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
•
8
•
23
fnlp/moss-rlhf-reward-model-7B-en
fnlp/moss-rlhf-sft-model-7B-en
fnlp/moss-rlhf-policy-model-7B-en
lightonai/alfred-40b-0723
Text Generation
•
Updated
•
42
•
45
kashif/stack-llama-2
Text Generation
•
Updated
•
2.01k
•
15
barnybug/stack-llama-2-ggml
Updated
•
12
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
25
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
•
20
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
108
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
17
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
22
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
•
14
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
•
112
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
•
107
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
•
6
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
•
6
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
•
56
ContextualAI/archangel_slic_pythia2-8b
Text Generation
•
Updated
•
7
ContextualAI/archangel_slic_pythia6-9b
Text Generation
•
Updated
•
12
ContextualAI/archangel_slic_pythia12-0b
Text Generation
•
Updated
•
8
ContextualAI/archangel_slic_llama7b
Text Generation
•
Updated
•
13
•
1
ContextualAI/archangel_slic_llama13b
Text Generation
•
Updated
•
12
ContextualAI/archangel_dpo_pythia1-4b
Text Generation
•
Updated
•
117