EpistemeAI
/

ReasoningCore-3B-R01

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

legolasyiu commited on Feb 9

Commit

b979cec

·

verified ·

1 Parent(s): 4df0cb9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ datasets:
 - **Model Developer:** EpitemeAI
 - **Model Architecture:**
-  ReasoningCore‑3B is an auto‑regressive language model built on an optimized transformer architecture. It incorporates specialized reasoning pathways and has been fine‑tuned using both supervised learning and reinforcement learning with human feedback (RLHF) to align with human expectations for clarity, accuracy, and safety in complex tasks.
 |                                | Training Data                                    | Params | Input Modalities      | Output Modalities            | Context Length | GQA | Shared Embeddings | Token Count    | Knowledge Cutoff  |
 |--------------------------------|--------------------------------------------------|--------|-----------------------|------------------------------|----------------|-----|-------------------|----------------|-------------------|

 - **Model Developer:** EpitemeAI
 - **Model Architecture:**
+  ReasoningCore‑3B is an auto‑regressive language model built on an optimized transformer architecture. It incorporates specialized reasoning pathways and has been fine‑tuned using Group Robust Preference Optimization(GRPO), and both supervised learning and reinforcement learning with human feedback (RLHF) to align with human expectations for clarity, accuracy, and safety in complex tasks.
 |                                | Training Data                                    | Params | Input Modalities      | Output Modalities            | Context Length | GQA | Shared Embeddings | Token Count    | Knowledge Cutoff  |
 |--------------------------------|--------------------------------------------------|--------|-----------------------|------------------------------|----------------|-----|-------------------|----------------|-------------------|