griko
/

height_reg_svr_ecapa_voxceleb

Audio Classification

height-estimation

speaker-characteristics

speaker-recognition

Model card Files Files and versions Community

griko commited on Nov 17, 2024

Commit

da371ad

·

verified ·

1 Parent(s): 350d4c0

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ This model combines the SpeechBrain ECAPA-TDNN speaker embedding model with an S
   - TIMIT test set: 6.02 cm Mean Absolute Error (MAE)
 ## Training Data
-The model was trained on VoxCeleb2 dataset:
 - Audio preprocessing:
   - Converted to WAV format, single channel, 16kHz sampling rate, 256 kp/s bitrate
   - Applied SileroVAD for voice activity detection, taking the first voiced segment
@@ -43,7 +43,7 @@ pip install git+https://github.com/griko/voice-height-regression.git
 ## Usage
 ```python
-from height_regressor import HeightRegressionPipeline
 # Load the pipeline
 regressor = HeightRegressionPipeline.from_pretrained(

   - TIMIT test set: 6.02 cm Mean Absolute Error (MAE)
 ## Training Data
+The model was trained on height enriched VoxCeleb2 dataset (for details read the paper):
 - Audio preprocessing:
   - Converted to WAV format, single channel, 16kHz sampling rate, 256 kp/s bitrate
   - Applied SileroVAD for voice activity detection, taking the first voiced segment
 ## Usage
 ```python
+from voice_height_regressor import HeightRegressionPipeline
 # Load the pipeline
 regressor = HeightRegressionPipeline.from_pretrained(