smishr-18 commited on
Commit
1e58147
·
verified ·
1 Parent(s): 312839b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -18
README.md CHANGED
@@ -7,28 +7,17 @@ language:
7
  - en
8
  ---
9
 
10
- # Model Card for Model ID
11
-
12
- <!-- Provide a quick summary of what the model is/does. -->
13
-
14
-
15
-
16
- ## Model Details
17
-
18
- ### Model Description
19
 
20
  <!-- Provide a longer summary of what this model is. -->
21
 
22
- This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 
 
23
 
24
- - **Developed by:** Shubh Mishra, 2024
25
- - **Model Type:** NLP
26
- - **Language(s) (NLP):** English
27
- - **License:** MIT
28
- - **Finetuned from model:** microsoft/Phi-3-mini-4k-instruct
29
 
30
-
31
- ## Uses
32
 
33
  ```Python
34
  import transformers
@@ -77,4 +66,15 @@ sequences = pipeline(
77
  )
78
  print(sequences[0]['generated_text'])
79
 
80
- ```
 
 
 
 
 
 
 
 
 
 
 
 
7
  - en
8
  ---
9
 
10
+ ### Phi3-DPO (The Finetuned One)
 
 
 
 
 
 
 
 
11
 
12
  <!-- Provide a longer summary of what this model is. -->
13
 
14
+ DPO fine-tuned of microsoft/Phi-3-mini-4k-instruct (3.82B params) on Intel/orca_dpo_pairs preference dataset.
15
+ **Phi3-TheFinetunedOne** is finetuned after configuring the microsoft/Phi-3-mini-4k-instruct model with Peft.
16
+ Named after the Anime Character Saturo Gojo.
17
 
18
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/658f7b32dfca9fad61344f82/AiWqrbc0HXB7_DpDhZr4z.webp" alt="Image Description" width="400"/>
 
 
 
 
19
 
20
+ ## Usage
 
21
 
22
  ```Python
23
  import transformers
 
66
  )
67
  print(sequences[0]['generated_text'])
68
 
69
+ ```
70
+
71
+ ## Limitations
72
+
73
+ Phi3-TheFinetunedOne was finetuned on T4 Colab GPU and could be fintuned with more adapters on
74
+ devices with ```torch.cuda.get_device_capability()[0] >= 8``` or Ampere GPUs.
75
+
76
+ - **Developed by:** Shubh Mishra, 2024
77
+ - **Model Type:** NLP
78
+ - **Language(s) (NLP):** English
79
+ - **License:** MIT
80
+ - **Finetuned from model:** microsoft/Phi-3-mini-4k-instruct