Shri2703 commited on
Commit
b8a1388
·
verified ·
1 Parent(s): 50dfff9

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,3 +1,21 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ language: en
4
+ license: apache-2.0
5
+ base_model: Qwen/Qwen2.5-1.5B-Instruct
6
+ tags:
7
+ - qwen
8
+ - lora
9
+ - peft
10
+ - causal-lm
11
+ ---
12
+
13
+ # Qwen2.5-1.5B-Instruct Fine-tuned Model
14
+
15
+ This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) using LoRA (Low-Rank Adaptation).
16
+
17
+ ## Training Details
18
+ - Model was trained for 2 epochs on a custom dataset
19
+ - Used 4-bit quantization for efficient training
20
+ - Used the LoRA+ technique with 16.0 ratio
21
+ - Trained with a batch size of 1 and gradient accumulation steps of 12