Yassinj
/

Llama-3.1-8B_medical

Question Answering

Model card Files Files and versions Community

Yassinj commited on Jan 3

Commit

4bf0f10

·

verified ·

1 Parent(s): ba2b748

Update README.md

Files changed (1) hide show

README.md +56 -4

README.md CHANGED Viewed

@@ -7,8 +7,6 @@ base_model:
 pipeline_tag: question-answering
 ---
-# LLaMA 3.1-8B Fine-Tuned on ChatDoctor Dataset
 ## Model Overview
 This model is a fine-tuned version of the LLaMA 3.1-8B model, trained on a curated selection of 1,122 samples from the **ChatDoctor (HealthCareMagic-100k)** dataset. It has been optimized for tasks related to medical consultations.
@@ -50,8 +48,62 @@ The model was fine-tuned with the following hyperparameters:
 Validation was performed using a separate subset of the dataset. The final training and validation loss are as follows:
-![Training and Validation Loss](train-val-curve.png)
 ## Usage
 ### Loading the Model
-This model is hosted in **GGUF format** for optimal deployment. You can load and run the model using **LLaMA.cpp**.

 pipeline_tag: question-answering
 ---
 ## Model Overview
 This model is a fine-tuned version of the LLaMA 3.1-8B model, trained on a curated selection of 1,122 samples from the **ChatDoctor (HealthCareMagic-100k)** dataset. It has been optimized for tasks related to medical consultations.
 Validation was performed using a separate subset of the dataset. The final training and validation loss are as follows:
+<p align="center">
+  <img src="train-val-curve.png" alt="Training and Validation Loss" width="50%"/>
+</p>
+## Evaluation Results
+### Original Model
+- **ROUGE-1**: 0.1726
+- **ROUGE-2**: 0.0148
+- **ROUGE-L**: 0.0980
+### Fine-Tuned Model
+- **ROUGE-1**: 0.2177
+- **ROUGE-2**: 0.0337
+- **ROUGE-L**: 0.1249
 ## Usage
 ### Loading the Model
+This model is hosted in **GGUF format** for optimal deployment. You can load and run the model using **LLaMA.cpp**.
+#### Steps to Use
+1. Clone the LLaMA.cpp repository:
+    ```bash
+    git clone https://github.com/ggerganov/llama.cpp
+    cd llama.cpp
+    make
+    ```
+2. Download the model from Hugging Face:
+    ```bash
+    huggingface-cli login
+    wget https://huggingface.co/your-username/llama-3.1-8B-gguf/resolve/main/output_model.gguf
+    ```
+3. Run inference:
+    ```bash
+    ./main -m output_model.gguf -p "What are the symptoms of a common cold?" -t 4 -n 100
+    ```
+### Quantization Details
+The model is quantized to **Q4_0** for faster inference while maintaining reasonable accuracy. You can run it efficiently on CPUs with low memory requirements.
+## Limitations and Intended Use
+- **Not for Clinical Use**: This model is intended for educational purposes and general health advice. It should not replace professional medical consultation.
+- **Bias and Errors**: The model might exhibit biases present in the training data. Outputs should be interpreted with caution.
+## Acknowledgments
+- **Dataset**: ChatDoctor (HealthCareMagic-100k)
+- **Base Model**: LLaMA 3.1-8B
+- **Quantization Tools**: LLaMA.cpp
+## Citation
+If you use this model, please cite:
+```
+@article{yourcitation,
+  title={Fine-tuned LLaMA 3.1-8B on ChatDoctor Dataset},
+  author={Your Name},
+  year={2025},
+  publisher={Hugging Face}
+}