Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ March 23, 2024
|
|
10 |
Felix-8B is an experimental language model developed by Ontocord.ai, specializing in addressing lawfulness concerns under the Biden-Harris Executive Order on AI and the principles of the EU AI Act. This model has achieved one of the highest scores on the TruthfulQA benchmark compared to models of its size, showcasing its exceptional performance in providing accurate and reliable responses.
|
11 |
Felix-8B is **experimental and a research work product** and a DPO reinforcement learning version of [ontocord/sft-4e-exp2](https://huggingface.co/ontocord/sft-4e-exp2) which in turn is a fine-tuned version of [TencentARC/Mistral_Pro_8B_v0.1](https://huggingface.co/TencentARC/Mistral_Pro_8B_v0.1).
|
12 |
|
|
|
|
|
13 |

|
14 |
|
15 |
## Model Description
|
@@ -75,7 +77,7 @@ While Felix-8B has been designed with safety and truthfulness as its primary foc
|
|
75 |
|
76 |
Additionally, Felix-8B's performance may vary depending on the specific task and domain. Users should exercise caution when applying the model to sensitive or high-stakes situations and always verify the model's outputs with human oversight.
|
77 |
|
78 |
-
Felix-8B is **experimental and a research work product**.
|
79 |
|
80 |
## Ethical Considerations
|
81 |
|
|
|
10 |
Felix-8B is an experimental language model developed by Ontocord.ai, specializing in addressing lawfulness concerns under the Biden-Harris Executive Order on AI and the principles of the EU AI Act. This model has achieved one of the highest scores on the TruthfulQA benchmark compared to models of its size, showcasing its exceptional performance in providing accurate and reliable responses.
|
11 |
Felix-8B is **experimental and a research work product** and a DPO reinforcement learning version of [ontocord/sft-4e-exp2](https://huggingface.co/ontocord/sft-4e-exp2) which in turn is a fine-tuned version of [TencentARC/Mistral_Pro_8B_v0.1](https://huggingface.co/TencentARC/Mistral_Pro_8B_v0.1).
|
12 |
|
13 |
+
**If you find any issues please let us know in the Community section so we can improve the model.**
|
14 |
+
|
15 |

|
16 |
|
17 |
## Model Description
|
|
|
77 |
|
78 |
Additionally, Felix-8B's performance may vary depending on the specific task and domain. Users should exercise caution when applying the model to sensitive or high-stakes situations and always verify the model's outputs with human oversight.
|
79 |
|
80 |
+
Felix-8B is **experimental and a research work product**. We have found for example, the model may be overly verbose. Also, we are actively working on improving the training dataset. We have for example discovered some answers are too short in our dataset, or may have typos due to our use of a quantized generative model to create synthetic triples for DPO.
|
81 |
|
82 |
## Ethical Considerations
|
83 |
|