Shri2703
/

qwen2-5-1.5b-finetuned

Model card Files Files and versions Community

qwen2-5-1.5b-finetuned / README.md

Shri2703's picture

Upload README.md with huggingface_hub

b8a1388 verified about 1 month ago

|

history blame contribute delete

552 Bytes

metadata

language: en
license: apache-2.0
base_model: Qwen/Qwen2.5-1.5B-Instruct
tags:
  - qwen
  - lora
  - peft
  - causal-lm

Qwen2.5-1.5B-Instruct Fine-tuned Model

This model is a fine-tuned version of Qwen/Qwen2.5-1.5B-Instruct using LoRA (Low-Rank Adaptation).

Training Details

Model was trained for 2 epochs on a custom dataset
Used 4-bit quantization for efficient training
Used the LoRA+ technique with 16.0 ratio
Trained with a batch size of 1 and gradient accumulation steps of 12