Shri2703's picture
Upload README.md with huggingface_hub
b8a1388 verified
metadata
language: en
license: apache-2.0
base_model: Qwen/Qwen2.5-1.5B-Instruct
tags:
  - qwen
  - lora
  - peft
  - causal-lm

Qwen2.5-1.5B-Instruct Fine-tuned Model

This model is a fine-tuned version of Qwen/Qwen2.5-1.5B-Instruct using LoRA (Low-Rank Adaptation).

Training Details

  • Model was trained for 2 epochs on a custom dataset
  • Used 4-bit quantization for efficient training
  • Used the LoRA+ technique with 16.0 ratio
  • Trained with a batch size of 1 and gradient accumulation steps of 12