Setup

Install Packages

python3 -m venv env
source env/bin/activate
pip install -e .
git submodule update --init --recursive
pip install -e umx

Download VocalSet Dataset

wget https://zenodo.org/record/1442513/files/VocalSet1-2.zip?download=1
mv VocalSet.zip?download=1 VocalSet.zip
unzip VocalSet.zip

Training

Steps

Change Wandb and data root variables in shell_vars.sh and source shell_vars.sh
python scripts/train.py +exp=default

Experiments

Training parameters can be configured in cfg/exp/default.yaml. Here are some descriptions

num_kept_effects={[min, max]} range of Kept effects to apply to each file. Inclusive.
num_removed_effects={[min, max]} range of Removed effects to apply to each file. Inclusive.
model={model} architecture to use (see 'Models')
effects_to_keep={[effect]} Effects to apply but not remove (see 'Effects')
effects_to_remove={[effect]} Effects to remove (see 'Effects')
accelerator=null/'gpu' Use GPU (1 device) (default: null)
render_files=True/False Render files. Disable to skip rendering stage (default: True)
render_root={path/to/dir}. Root directory to render files to (default: DATASET_ROOT)

These can also be specified on the command line. see cfg/exp/default.yaml for an example.

Models

umx
demucs
tcn
dcunet
dptnet

Effects

chorus
compressor
distortion
reverb
delay

Run inference on directory

Assumes directory is structured as

root
- clean
  - file1.wav
  - file2.wav
  - file3.wav
- effected
  - file1.wav
  - file2.wav
  - file3.wav

Change root path in shell_vars.sh and source shell_vars.sh

python scripts/chain_inference.py +exp=chain_inference_custom

Misc.

By default, files are rendered to input_dir / processed / {string_of_effects} / {train|val|test}.

Download datasets:

python scripts/download.py vocalset guitarset idmt-smt-guitar idmt-smt-bass idmt-smt-drums

To run audio effects classifiction:

python scripts/train.py model=classifier "effects_to_use=[compressor, distortion, reverb, chorus, delay]" "effects_to_remove=[]" max_kept_effects=5 max_removed_effects=0 shuffle_kept_effects=True shuffle_removed_effects=True accelerator='gpu' render_root=/scratch/RemFX render_files=True

srun --comment harmonai --partition=g40 --gpus=1 --cpus-per-gpu=12 --job-name=harmonai --pty bash -i
source env/bin/activate
rsync -aP /fsx/home-csteinmetz1/data/EffectSet_cjs.tar /scratch
tar -xvf EffectSet_cjs.tar
mv scratch/EffectSet_cjs ./EffectSet_cjs

export DATASET_ROOT="/admin/home-csteinmetz1/data/remfx-data"
export WANDB_PROJECT="RemFX"
export WANDB_ENTITY="cjstein"

python scripts/train.py +exp=5-5.yaml model=cls_vggish render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_panns_pt render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_wav2vec2 render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64
python scripts/train.py +exp=5-5.yaml model=cls_wav2clip render_files=False logs_dir=/scratch/cjs-log datamodule.batch_size=64

Installing HEAR models

wav2clip

pip install hearbaseline
pip install git+https://github.com/hohsiangwu/wav2clip-hear.git
pip install git+https://github.com/qiuqiangkong/HEAR2021_Challenge_PANNs
wget https://zenodo.org/record/6332525/files/hear2021-panns_hear.pth