metadata

language:
  - en
  - zh
  - de
  - es
  - ru
  - fr
  - ja
  - ko
  - pt
  - tr
  - pl
  - it
  - nl
  - sv
tags:
  - whisper
  - openvino
  - int8
  - intel-igpu
  - speech-recognition
  - automatic-speech-recognition
  - unicorn-amanuensis
license: apache-2.0
pipeline_tag: automatic-speech-recognition

Whisper Base INT8 - Optimized for Intel iGPU 🚀

This is an INT8 quantized version of OpenAI's Whisper base model, specifically optimized for Intel integrated GPUs.

🎯 Key Features

4x smaller than FP32 (75MB vs 280MB)
2-4x faster inference on Intel iGPU
INT8 asymmetric quantization
100% weights quantized to INT8
OpenVINO 2024.0+ compatible

📊 Performance

Metric	Original	INT8	Improvement
Model Size	280MB	75MB	3.7x smaller
Inference Speed	1.0x	2-4x	2-4x faster
Memory Bandwidth	100%	30-50%	50-70% reduction

🎮 Optimized for Intel Hardware

Intel Arc Graphics (A770, A750, A380)
Intel Iris Xe Graphics (12th Gen+)
Intel UHD Graphics (11th Gen+)

📄 License

Apache 2.0

🦄 Part of Unicorn Amanuensis

Professional STT suite: https://github.com/Unicorn-Commander/Unicorn-Amanuensis