jakeBland commited on
Commit
0ab05ea
·
verified ·
1 Parent(s): a0b79e4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -7
README.md CHANGED
@@ -4,30 +4,41 @@ license: apache-2.0
4
  base_model: facebook/wav2vec2-xls-r-300m
5
  tags:
6
  - generated_from_trainer
 
 
 
7
  model-index:
8
- - name: test12
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/jakebland-bland-ai/vm_new/runs/bn1z5b94)
16
- # test12
17
 
18
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on an unknown dataset.
19
 
20
  ## Model description
21
 
22
- More information needed
23
 
24
  ## Intended uses & limitations
25
 
26
- More information needed
 
 
 
 
 
27
 
28
  ## Training and evaluation data
29
 
30
- More information needed
 
 
 
 
31
 
32
  ## Training procedure
33
 
 
4
  base_model: facebook/wav2vec2-xls-r-300m
5
  tags:
6
  - generated_from_trainer
7
+ - speech-recognition
8
+ - audio-classification
9
+ - voicemail-detection
10
  model-index:
11
+ - name: wav2vec-vm-finetune
12
  results: []
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # wav2vec-vm-finetune
 
19
 
20
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) for **voicemail detection**. It is trained on a dataset of call recordings to distinguish between **voicemail greetings** and **live human responses**.
21
 
22
  ## Model description
23
 
24
+ This model builds on **wav2vec2-xls-r-300m**, a self-supervised speech model trained on large-scale multilingual data. We fine-tuned it on the first two seconds of a call. T**98% accuracy**.
25
 
26
  ## Intended uses & limitations
27
 
28
+ - Automated voicemail detection in AI-powered call assistants.
29
+ - Filtering voicemail responses in customer service and sales call automation.
30
+
31
+ - Only trianed on the English language.
32
+ - Assumes the voicemail track is isolated and contains no audio from the caller.
33
+ - Designed for the first two seconds of audio when calling a voicemail.
34
 
35
  ## Training and evaluation data
36
 
37
+ The model was trained on a proprietary dataset of call recordings, labeled as:
38
+ - **Live human responses**
39
+ - **Voicemail greetings**
40
+
41
+ The dataset includes diverse voicemail recordings across multiple types to improve generalization.
42
 
43
  ## Training procedure
44