AventIQ-AI
/

pythia-410m

Model card Files Files and versions Community

pythia-410m / README.md

ayushsinha's picture

Update README.md

61f86a4 verified about 1 month ago

|

history blame contribute delete

2.48 kB

	Pythia Quantized Model for Sentiment Analysis
	=============================================
	This repository hosts a quantized version of the Pythia model, fine-tuned for sentiment analysis tasks.
	The model has been optimized for efficient deployment while maintaining high accuracy, making it suitable for
	resource-constrained environments.

	Model Details
	-------------

	* Model Architecture: Pythia-410m
	* Task: Sentiment Analysis
	* Dataset: IMDb Reviews
	* Quantization: Float16
	* Fine-tuning Framework: Hugging Face Transformers

	The quantized model achieves comparable performance to the full-precision model while reducing memory usage and inference time.

	Usage
	-----

	### Installation

	pip install transformers torch

	### Loading the Model

	from transformers import AutoTokenizer, AutoModelForCausalLM

	tokenizer = AutoTokenizer.from_pretrained("AventIQ-AI/pythia-410m")
	model = AutoModelForCausalLM.from_pretrained(model_name, low_cpu_mem_usage=True)

	# Example usage
	text = "This product is amazing!"
	inputs = tokenizer(text, return_tensors="pt")
	outputs = model.generate(**inputs, max_new_tokens=20)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))

	Performance Metrics
	-------------------

	* Accuracy: 0.56
	* F1 Score: 0.56
	* Precision: 0.68
	* Recall: 0.56

	Fine-Tuning Details
	-------------------

	### Dataset

	The IMDb Reviews dataset was used, containing both positive and negative sentiment examples.

	### Training

	* Number of epochs: 3
	* Batch size: 8
	* evaluation_strategy= epoch
	* Learning rate: 2e-5

	### Quantization

	Post-training quantization was applied using PyTorch's built-in quantization framework to reduce the model size and improve inference efficiency.

	Repository Structure
	--------------------

	.
	├── model/ # Contains the quantized model files
	├── tokenizer/ # Tokenizer configuration and vocabulary files
	├── model.safensors/ # Fine Tuned Model
	├── README.md # Model documentation

	Limitations
	-----------

	* The model may not generalize well to domains outside the fine-tuning dataset.
	* Quantization may result in minor accuracy degradation compared to full-precision models.


	Contributing
	------------

	Contributions are welcome! Feel free to open an issue or submit a pull request if you have suggestions or improvements.