Music Source Separation CoreML Model Creation

Repository for creating CoreML models for music source separation for on-device inference. This repository enables conversion of trained music source separation models to CoreML format for efficient execution on Apple devices.

Thanks to ZFTurbo for the original model and inference implementations.

Compatible Models

Currently compatible models for CoreML conversion:

MDX23C based on KUIELab TFC TDF v3 architecture. Key: mdx23c.
Demucs4HT [Paper]. Key: htdemucs.
Band Split RoFormer [Paper, Repository] . Key: bs_roformer.
Mel-Band RoFormer [Paper, Repository]. Key: mel_band_roformer.
SCNet [Paper, Official Repository, Unofficial Repository] Key: scnet.

Note: Thanks to @lucidrains for recreating the RoFormer models based on papers.

How to: CoreML Conversion

To convert a trained model to CoreML format:

Run the model conversion script:

python model_coreml_conversion.py \
    --model_type <model_type> \
    --config_path <config_path> \
    --checkpoint <checkpoint_path>

Since iSTFT is not supported by CoreML, you must export it separately:

python istft_coreml_conversion.py \
    --model_type <model_type> \
    --config_path <config_path> \
    --checkpoint <checkpoint_path>

Conversion Example

# Convert the main model
python model_coreml_conversion.py \
    --model_type mel_band_roformer \
    --config_path configs/config_mel_band_roformer_vocals.yaml \
    --checkpoint results/model.ckpt

# Convert the iSTFT component separately
python istft_coreml_conversion.py \
    --model_type mel_band_roformer \
    --config_path configs/config_mel_band_roformer_vocals.yaml \
    --checkpoint results/model.ckpt

How to: Testing

To test your converted CoreML models:

python test_coreml_conversion.py <path_to_model.mlpackage> <path_to_istft.mlpackage>

Testing Example

python test_coreml_conversion.py model.mlpackage istft.mlpackage

How to: Inference

For regular inference without CoreML to test modules, you can run:

python inference_coreml.py \
    --model_type mdx23c \
    --config_path configs/config_mdx23c_musdb18.yaml \
    --start_check_point results/last_mdx23c.ckpt \
    --input_folder input/wavs/ \
    --store_dir separation_results/

This uses the same arguments as the original inference.py script.

Useful notes

CoreML models are optimized for on-device inference on Apple hardware (iOS, macOS).
The iSTFT component must be exported separately due to CoreML limitations.
For fastest runtime performance, consider implementing iSTFT directly in Swift or C++. The iSTFT conversion provided here is for convenience.
Make sure you have the necessary dependencies installed for CoreML conversion.

Code description

configs/config_*.yaml - configuration files for models
models/* - set of available models for training and inference
dataset.py - dataset which creates new samples for training
gui-wx.py - GUI interface for code
inference.py - process folder with music files and separate them
train.py - main training code
train_accelerate.py - experimental training code to use with accelerate module. Speed up for MultiGPU.
utils.py - common functions used by train/valid
valid.py - validation of model with metrics
ensemble.py - useful script to ensemble results of different models to make results better (see docs).

Pre-trained models

Look here: List of Pre-trained models

If you trained some good models, please, share them. You can post config and model weights in this issue.

Dataset types

Look here: Dataset types

Augmentations

Look here: Augmentations

Graphical user interface

Look here: GUI documentation or see tutorial on Youtube

Citation

arxiv paper

@misc{solovyev2023benchmarks,
      title={Benchmarks and leaderboards for sound demixing tasks}, 
      author={Roman Solovyev and Alexander Stempkovskiy and Tatiana Habruseva},
      year={2023},
      eprint={2305.07489},
      archivePrefix={arXiv},
      primaryClass={cs.SD}
}

Name		Name	Last commit message	Last commit date
Latest commit History 406 Commits
configs		configs
gui		gui
models		models
scripts		scripts
test_data		test_data
tests		tests
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
STFT_Process.py		STFT_Process.py
inference.py		inference.py
inference_coreml.py		inference_coreml.py
istft_coreml_conversion.py		istft_coreml_conversion.py
model_coreml_conversion.py		model_coreml_conversion.py
pixi.lock		pixi.lock
pixi.toml		pixi.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Music Source Separation CoreML Model Creation

Compatible Models

How to: CoreML Conversion

Conversion Example

How to: Testing

Testing Example

How to: Inference

Useful notes

Code description

Pre-trained models

Dataset types

Augmentations

Graphical user interface

Citation

About

Uh oh!

Releases

Packages

Languages

License

ajayarora1235/CoreML-Music-Source-Separation

Folders and files

Latest commit

History

Repository files navigation

Music Source Separation CoreML Model Creation

Compatible Models

How to: CoreML Conversion

Conversion Example

How to: Testing

Testing Example

How to: Inference

Useful notes

Code description

Pre-trained models

Dataset types

Augmentations

Graphical user interface

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages