metadata
base_model: unsloth/Llama-3.2-1B-Instruct
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- gguf
license: apache-2.0
language:
- en
π§ Llama 3.2 - 1B Instruct | Toolcalling Test Finetuned Model (ADROIT NOT USING ANYMORE DEVELOPED FOR TESTING ONLY)
- Developed by: nawazadroit
- License: Apache-2.0
- Finetuned from base model:
unsloth/Llama-3.2-1B-Instruct
π§ͺ Purpose
This is a simple tool-calling test model, finetuned specifically on a custom dataset (dataset.json) related to scheme application workflows. It is designed for structured assistant-like behavior with basic tool invocation capabilities, especially in the domain of public healthcare scheme assistance (e.g., applying to MJPJAY / PM-JAY schemes in government hospitals).
π Features
- β Lightweight 1B parameter model
- β Uses Huggingface's TRL library for reward modeling and instruction tuning
- β Ideal for testing local tool-calling setups
- β Compatible with GGUF format for efficient inference
π Dataset Info
The model was trained on a JSON dataset (dataset.json) containing multi-turn dialogues structured for:
- Verifying patient eligibility
- Applying health schemes
- Extracting document information
- Calling specific tools via structured API-like responses
π Links
- π Hugging Face Transformers
- π¬ TRL Library