deep-div
/

MediLlama-3.2

Text Generation

text-generation-inference

Model card Files Files and versions

InferenceLab commited on May 16

Commit

a256681

·

verified ·

1 Parent(s): e2bda88

Update README.md

Files changed (1) hide show

README.md +1 -29

README.md CHANGED Viewed

@@ -32,12 +32,6 @@ This model is a domain-adapted version of LLaMA 3.2 3B Instruct. It has been fin
 - **License:** Apache 2.0
 - **Finetuned from model:** meta-llama/Llama-3.2-3B-Instruct
-### Model Sources
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
 ## Uses
 ### Direct Use
@@ -96,14 +90,11 @@ Tokenization using LLaMA tokenizer with special medical instruction formatting.
 #### Training Hyperparameters
 * **Training regime:** bf16 mixed precision
-* **Epochs:** 3
-* **Batch size:** 64
-* **Learning rate:** 2e-5
 #### Speeds, Sizes, Times
 * **Training time:** \~12 hours on 4×A100 GPUs
-* **Final model size:** \~3.1B parameters
 ## Evaluation
@@ -165,23 +156,6 @@ Explainability tools like LLaMA-MedLens (if available) are suggested to interpre
 * Unsloth
 * PyTorch 2.1
-## Citation
-**BibTeX:**
-```bibtex
-@misc{medillama_2025,
-  author       = {InferenceLab},
-  title        = {MediLlama-3.2: A Medical Chatbot Fine-Tuned from LLaMA 3.2},
-  year         = {2025},
-  publisher    = {HuggingFace},
-  howpublished = {\url{https://huggingface.co/InferenceLab/MediLlama-3.2}},
-}
-```
-**APA:**
-InferenceLab. (2025). *MediLlama-3.2: A Medical Chatbot Fine-Tuned from LLaMA 3.2*. Hugging Face. [https://huggingface.co/InferenceLab/MediLlama-3.2](https://huggingface.co/InferenceLab/MediLlama-3.2)
 ## Glossary
@@ -197,8 +171,6 @@ For collaborations, deployment help, or fine-tuning extensions, please contact t
 * InferenceLab Team
-## Model Card Contact
-* [[email protected]](mailto:[email protected])

 - **License:** Apache 2.0
 - **Finetuned from model:** meta-llama/Llama-3.2-3B-Instruct
 ## Uses
 ### Direct Use
 #### Training Hyperparameters
 * **Training regime:** bf16 mixed precision
+* **Learning rate:** 1e-5
 #### Speeds, Sizes, Times
 * **Training time:** \~12 hours on 4×A100 GPUs
 ## Evaluation
 * Unsloth
 * PyTorch 2.1
 ## Glossary
 * InferenceLab Team