-
-
-
-
-
-
Active filters:
beaver
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
•
10
•
10
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
•
859
•
16
PKU-Alignment/beaver-dam-7b
Updated
•
1.66k
•
6
PKU-Alignment/beaver-7b-v1.0-cost
Reinforcement Learning
•
Updated
•
796
•
9
PKU-Alignment/alpaca-7b-reproduced
Updated
•
1.76k
•
5
ingmarnitze/yolov8_arcticbeavers
Object Detection
•
Updated
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
Updated
•
6
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
Updated
•
90
PKU-Alignment/beaver-7b-v2.0-cost
Reinforcement Learning
•
Updated
•
5
PKU-Alignment/beaver-7b-v3.0
Reinforcement Learning
•
Updated
•
214
PKU-Alignment/beaver-7b-v3.0-reward
Reinforcement Learning
•
Updated
•
112
PKU-Alignment/beaver-7b-v3.0-cost
Reinforcement Learning
•
Updated
•
11
PKU-Alignment/beaver-7b-unified-reward
Reinforcement Learning
•
Updated
•
165
PKU-Alignment/beaver-7b-unified-cost
Reinforcement Learning
•
Updated
•
523
•
1
PKU-Alignment/alpaca-7b-reproduced-llama-2
Updated
•
113
•
1
PKU-Alignment/alpaca-8b-reproduced-llama-3
Updated
•
228
PKU-Alignment/alpaca-70b-reproduced-llama-3