Papers
AI & ML interests
R3 Model is all you need
Recent Activity
View all activity
models
66

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-14B-LoRA-4k
Text Generation
•
Updated
•
14

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-8B-14k
Text Generation
•
Updated
•
13

rubricreward/LLaMA-3.2-3B-DPO-HelpSteer3-R3-Qwen3-4B-14k
Text Generation
•
Updated
•
13

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-4k
15B
•
Updated
•
7

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-LoRA-14k
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-14k
Text Generation
•
15B
•
Updated
•
10

rubricreward/R3-DeepSeek-R1-Distill-Qwen-14B-4k
Text Generation
•
15B
•
Updated
•
11

rubricreward/R3-Phi-4-reasoning-plus-LoRA-14k
15B
•
Updated
•
13

rubricreward/R3-Qwen3-14B-LoRA-14k
15B
•
Updated
•
15

rubricreward/R3-Qwen3-8B-LoRA-14k
Text Generation
•
8B
•
Updated
•
10
•
2
datasets
146
rubricreward/PolyGuardMix
Viewer
•
Updated
•
2.99M
•
121
rubricreward/mR3-Dataset-Filtered1
Viewer
•
Updated
•
179k
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
624k
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
631k
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking-filtered_correct
Viewer
•
Updated
•
638k
rubricreward/PolyGuardMix-filtered-tgt_prompt_tgt_thinking
Viewer
•
Updated
•
890k
rubricreward/PolyGuardMix-filtered-tgt_prompt_en_thinking
Viewer
•
Updated
•
904k
rubricreward/PolyGuardMix-filtered-en_prompt_en_thinking
Viewer
•
Updated
•
903k
rubricreward/HelpSteer3
Viewer
•
Updated
•
40.5k
•
115
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking-filtered_correct
Viewer
•
Updated
•
12.7k
•
14