Spaces:
Sleeping
Sleeping
Setup
Install Packages
python3 -m venv env
source env/bin/activate
pip install -e .
git submodule update --init --recursive
pip install -e umx
Download VocalSet Dataset
wget https://zenodo.org/record/1442513/files/VocalSet1-2.zip?download=1
mv VocalSet.zip?download=1 VocalSet.zip
unzip VocalSet.zip
Training
Steps
- Change Wandb and data root variables in
shell_vars.sh
andsource shell_vars.sh
python scripts/train.py +exp=default
Experiments
Training parameters can be configured in cfg/exp/default.yaml
. Here are some descriptions
num_kept_effects={[min, max]}
range of Kept effects to apply to each file. Inclusive.num_removed_effects={[min, max]}
range of Removed effects to apply to each file. Inclusive.model={model}
architecture to use (see 'Models')effects_to_keep={[effect]}
Effects to apply but not remove (see 'Effects')effects_to_remove={[effect]}
Effects to remove (see 'Effects')accelerator=null/'gpu'
Use GPU (1 device) (default: null)render_files=True/False
Render files. Disable to skip rendering stage (default: True)render_root={path/to/dir}
. Root directory to render files to (default: DATASET_ROOT)
These can also be specified on the command line.
see cfg/exp/default.yaml
for an example.
Models
umx
demucs
tcn
dcunet
dptnet
Effects
chorus
compressor
distortion
reverb
delay
Run inference on directory
Assumes directory is structured as
- root
- clean
- file1.wav
- file2.wav
- file3.wav
- effected
- file1.wav
- file2.wav
- file3.wav
- clean
Change root path in shell_vars.sh
and source shell_vars.sh
python scripts/chain_inference.py +exp=chain_inference_custom
Misc.
By default, files are rendered to input_dir / processed / {string_of_effects} / {train|val|test}
.
Download datasets:
python scripts/download.py vocalset guitarset idmt-smt-guitar idmt-smt-bass idmt-smt-drums
To run audio effects classifiction:
python scripts/train.py model=classifier "effects_to_use=[compressor, distortion, reverb, chorus, delay]" "effects_to_remove=[]" max_kept_effects=5 max_removed_effects=0 shuffle_kept_effects=True shuffle_removed_effects=True accelerator='gpu' render_root=/scratch/RemFX render_files=True
srun --comment harmonai --partition=g40 --gpus=1 --cpus-per-gpu=12 --job-name=harmonai --pty bash -i
source env/bin/activate
rsync -aP /fsx/home-csteinmetz1/data/EffectSet_cjs.tar /scratch
tar -xvf EffectSet_cjs.tar
mv scratch/EffectSet_cjs ./EffectSet_cjs
export DATASET_ROOT="/admin/home-csteinmetz1/data/remfx-data"
export WANDB_PROJECT="RemFX"
export WANDB_ENTITY="cjstein"
python scripts/train.py +exp=5-5.yaml model=cls_vggish render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_panns_pt render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_wav2vec2 render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_wav2clip render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
Installing HEAR models
wav2clip
pip install hearbaseline
pip install git+https://github.com/hohsiangwu/wav2clip-hear.git
pip install git+https://github.com/qiuqiangkong/HEAR2021_Challenge_PANNs
wget https://zenodo.org/record/6332525/files/hear2021-panns_hear.pth