Skip to content

Skonor/dla2023_ss

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

This repo contains framework for SS training with Spex+ (classification head included)

Installation guide

pip install -r ./requirements.txt

To load checkpoints run:

python scripts/load_checkpoints.py

Dataset creation:

  1. Install train-clean-100 and test-clean parts from librispeech
python scripts/load_librispeech.py
  1. Creaete 10k (100 speakers) mixes for training and 1k mixes for evaluation
python scripts/create_mixes.py

This will create train and val directories in directory in data/datasets/LibriMixes, each containig 3 directories mix, refs, targets with audio files.

Training

To reproduce training do the following (All training was done on kaggle, so you will need to change paths in config)

  1. Train for 20k steps (batch size = 3)
python train.py -c hw_asr/configs/SpexPlus_config/config.json
  1. Finetune model for 20k steps with the same config and dataset:
python train.py -c hw_asr/configs/SpexPlus_config/config.json -f saved/checkpoints/spex_plus_20k/spex_plus_20k/model_weights.pth

(Here I used checkpoint loaded by load_checkpoints.py script. You can change -f path to your local weights destination)

Evaluation

For evaluating models on custom dataset do the following:

  1. Load checkpoints from training:
python scripts/load_checkpoints.py

This will create checkpoints dirs is saved/models/ contaning model weigths file and training config.

  1. Run test.py:
python test.py /librispeech_clean.json -r saved/checkpoints/spex_plus_finetuned/model_weights.pth -t <your_directory>

This will print out SI-SDR and CER metrics for each of prediction methods

Results

For Spex+ model trained for 20k steps we get the following results:

data SI-SDR PESQ
eval 8.24 1.92
public test 7.35 1.38

And after training for another 20k strps:

data SI-SDR PESQ
eval 9.70 2.07
public test 8.83 1.54

Credits

This repository is based on a heavily modified fork of pytorch-template repository.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published