smishr-18
/

Phi3-TheFinetunedOne

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

smishr-18 commited on Jul 26, 2024

Commit

1e58147

·

verified ·

1 Parent(s): 312839b

Update README.md

Files changed (1) hide show

README.md +18 -18

README.md CHANGED Viewed

@@ -7,28 +7,17 @@ language:
 - en
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
 <!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** Shubh Mishra, 2024
-- **Model Type:** NLP
-- **Language(s) (NLP):** English
-- **License:** MIT
-- **Finetuned from model:** microsoft/Phi-3-mini-4k-instruct
-## Uses
 ```Python
 import transformers
@@ -77,4 +66,15 @@ sequences = pipeline(
 )
 print(sequences[0]['generated_text'])
-```

 - en
 ---
+### Phi3-DPO (The Finetuned One)
 <!-- Provide a longer summary of what this model is. -->
+DPO fine-tuned of microsoft/Phi-3-mini-4k-instruct (3.82B params) on Intel/orca_dpo_pairs preference dataset.
+**Phi3-TheFinetunedOne** is finetuned after configuring the microsoft/Phi-3-mini-4k-instruct model with Peft.
+Named after the Anime Character Saturo Gojo.
+<img src="https://cdn-uploads.huggingface.co/production/uploads/658f7b32dfca9fad61344f82/AiWqrbc0HXB7_DpDhZr4z.webp" alt="Image Description" width="400"/>
+## Usage
 ```Python
 import transformers
 )
 print(sequences[0]['generated_text'])
+```
+## Limitations
+Phi3-TheFinetunedOne was finetuned on T4 Colab GPU and could be fintuned with more adapters on
+devices with ```torch.cuda.get_device_capability()[0] >= 8``` or Ampere GPUs.
+- **Developed by:** Shubh Mishra, 2024
+- **Model Type:** NLP
+- **Language(s) (NLP):** English
+- **License:** MIT
+- **Finetuned from model:** microsoft/Phi-3-mini-4k-instruct