RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-3B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 21 days ago
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.2-1B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 20 days ago
RLHF-And-Friends/RM-UltrafeedbackBinarized-Llama-3.1-8B-Instruct-Q4-LoRA8-Batch-16-Tok-1024 Updated 20 days ago