You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Reasoning Llama model

Open Reasoning Llama model

Reference

Open-R1: a fully open reproduction of DeepSeek-R1

Uploaded model

  • Developed by: EpistemeAI
  • License: apache-2.0
  • Finetuned from model : unsloth/meta-llama-3.1-8b-instruct-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
68
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for EpistemeAI/Reasoning-Llama-3.1-CoT-RE1

Quantizations
2 models

Dataset used to train EpistemeAI/Reasoning-Llama-3.1-CoT-RE1