cahya's picture
add model files
0bf8220
|
raw
history blame
181 Bytes
metadata
license: apache-2.0

Whisper large audio captioning

This model is a finetuned whisper-large-v2 model with 1M audio samples from the dataset mitermix/audiosnippets