Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,45 @@ language:
|
|
4 |
- en
|
5 |
tags:
|
6 |
- medical
|
7 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- en
|
5 |
tags:
|
6 |
- medical
|
7 |
+
---
|
8 |
+
|
9 |
+
# Model Card for Fine-Tuned Whisper Model: Hypernasality Identification
|
10 |
+
|
11 |
+
This model card details the Whisper model, fine-tuned to identify hypernasality in patients from audio recordings. The model is developed to assist in the diagnosis and monitoring of speech disorders.
|
12 |
+
|
13 |
+
## Model Details
|
14 |
+
|
15 |
+
### Model Description
|
16 |
+
- **Developed by**: The Data Science team at the Vanderbilt Data Science Institute
|
17 |
+
- **Funded by**: This project was supported by healthcare-focused research grants and institutional funding.
|
18 |
+
- **Shared by**: The model is shared by Vanderbilt University with the aim to assist clinicians and speech therapists.
|
19 |
+
- **Model type**: Deep Learning, Neural Network (Fine-Tuned Whisper Model)
|
20 |
+
- **Language(s)** (NLP): The model primarily focuses on English language audio samples but has the potential to be adapted for multilingual use.
|
21 |
+
- **License**: The model is released under an open-source license for non-commercial use, details of which can be found in the repository.
|
22 |
+
- **Finetuned from model**: The base model is OpenAI's Whisper, which has been fine-tuned using a specialized dataset for hypernasality detection.
|
23 |
+
|
24 |
+
|
25 |
+
### Model Sources
|
26 |
+
- **Repository**: https://github.com/jaewoocho/Transformers_whisper_cleft/tree/main?tab=readme-ov-file
|
27 |
+
- **Demo** : https://huggingface.co/spaces/jcho02/Transformers_whisper_cleft
|
28 |
+
|
29 |
+
## Uses
|
30 |
+
|
31 |
+
### Direct Use
|
32 |
+
- This model can be directly used by healthcare professionals to analyze patient speech samples for signs of hypernasality, aiding in diagnosis and treatment planning.
|
33 |
+
|
34 |
+
### Downstream Use
|
35 |
+
- Potential use in automated speech therapy tools or in research settings for studying speech disorders.
|
36 |
+
|
37 |
+
### Out-of-Scope Use
|
38 |
+
- Not intended for non-medical use or as a sole diagnostic tool without clinical oversight.
|
39 |
+
|
40 |
+
## Bias, Risks, and Limitations
|
41 |
+
- The model might have limitations in accurately identifying hypernasality across diverse accents and languages. User discretion is advised, and results should be considered alongside clinical assessments.
|
42 |
+
|
43 |
+
## Recommendations
|
44 |
+
- Healthcare professionals should use this tool as a supplement to traditional diagnostic methods. Continuous updating with diverse datasets is recommended to improve accuracy.
|
45 |
+
|
46 |
+
## How to Get Started with the Model
|
47 |
+
- Use the following code snippet to get started with the model:
|
48 |
+
- https://github.com/jaewoocho/Transformers_whisper_cleft/tree/main?tab=readme-ov-file
|