Skip to content

Latest commit

 

History

History
54 lines (45 loc) · 1.88 KB

README.md

File metadata and controls

54 lines (45 loc) · 1.88 KB

Neural FCA for Blind Source Separation

This repository provides inference scripts for Neural FCA proposed in our paper: Neural Full-rank Spatial Covariance Analysis for Blind Source Separation.

Please cite as:

@article{bando2021neural,
  title={Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation},
  author={Bando, Yoshiaki and Sekiguchi, Kouhei and Masuyama, Yoshiki and Nugraha, Aditya Arie and Fontaine, Mathieu and Yoshii, Kazuyoshi},
  journal={IEEE Signal Processing Letters},
  volume={28},
  pages={1670--1674},
  year={2021},
  publisher={IEEE}
}

Environments

Neural FCA was developed with Python 3.8 and the following requirements:

pip install -r requirements.txt

Pre-trained model

The pre-trained model used in the paper for separating speech mixtures can be downloaded from the following URL:

wget https://github.com/yoshipon/spl2021_neural-fca/releases/download/spl2021/model.zip
unzip model.zip

Source separation

  1. The pre-trained model can separate four-channel two-speech mixtures as follows:
python neural-fca/separate.py model/ input.wav output.wav

The model will predict three sources assuming two target sources and one noise source.

  1. If you want to perform the separation without inference-time parameter updates, run the following command:
python neural-fca/separate.py model/ input.wav output.wav --n_iter=0
  1. You can obtain neural-fca.png showing mixture and source spectrograms by:
python neural-fca/separate.py model/ input.wav output.wav --n_iter=0 --plot

License

This repository is released under the MIT License. The pre-trained model is released under the Creative Commons BY-NC-ND 4.0 License.

Contact

Yoshiaki Bando, [email protected]

National Institute of Advanced Industrial Science and Technology, Japan