π§ Llama 3.2 - 1B Instruct | Toolcalling Test Finetuned Model (ADROIT NOT USING ANYMORE DEVELOPED FOR TESTING ONLY)
- Developed by: nawazadroit
- License: Apache-2.0
- Finetuned from base model:
unsloth/Llama-3.2-1B-Instruct
π§ͺ Purpose
This is a simple tool-calling test model, finetuned specifically on a custom dataset (dataset.json
) related to scheme application workflows. It is designed for structured assistant-like behavior with basic tool invocation capabilities, especially in the domain of public healthcare scheme assistance (e.g., applying to MJPJAY / PM-JAY schemes in government hospitals).
π Features
- β Lightweight 1B parameter model
- β Uses Huggingface's TRL library for reward modeling and instruction tuning
- β Ideal for testing local tool-calling setups
- β Compatible with GGUF format for efficient inference
π Dataset Info
The model was trained on a JSON dataset (dataset.json
) containing multi-turn dialogues structured for:
- Verifying patient eligibility
- Applying health schemes
- Extracting document information
- Calling specific tools via structured API-like responses
π Links
- π Hugging Face Transformers
- π¬ TRL Library
- Downloads last month
- 1
Hardware compatibility
Log In
to view the estimation
8-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
π
Ask for provider support
Model tree for nawazadroit/vaidya1st_toolcalling
Base model
meta-llama/Llama-3.2-1B-Instruct
Finetuned
unsloth/Llama-3.2-1B-Instruct