--- license: other license_name: willisdiarize-v1-quantized-license license_link: LICENSE --- # WillisDiarize-v1-quantized ## Description WillisDiarize-v1-quantized is an ensemble model for diarization correction as a post-processing step. It was fine-tuned using the Mistral-7B-Instruct v0.2 foundational model. This is the quantized version of the original model. During fine-tuning, three separate automated speech recognition tools (namely AWS, Azure, and WhisperX) were used to generate the transcripts used. All fine-tuning was done on the Fisher corpus, a dataset of approximately 12,000 recorded conversations and their transcripts. For a full description of model development and performance testing, please read our [preprint](https://arxiv.org/pdf/2406.04927), Efstathiadis et al. (2024). WillisDiarize is free to use for non-commercial purposes; see [here](https://huggingface.co/bklynhlth/WillisDiarize-v1-quantized/blob/main/LICENSE) for the full license text. If you are interested in using this model commercially, please getintouch@brooklyn.health. ## Usage Install OpenWillis to easily access this model on both [AWS cloud](https://github.com/bklynhlth/openwillis/wiki/WillisDiarize-with-AWS-v1.0) or any high-performance [GPU machine](https://github.com/bklynhlth/openwillis/wiki/WillisDiarize-v1.0). Follow the installation steps [here](https://github.com/bklynhlth/openwillis/wiki/Getting-started). ## Citation ``` @article{bklynhlth/WillisDiarize, title={LLM-based speaker diarization correction: A generalizable approach}, author={Efstathiadis et al.}, journal={arXiv preprint arXiv:2406.04927}, year={2024} } ```