Grais, Emad Mounir and Erdoğan, Hakan (2013) Spectro-temporal post-enhancement using MMSE estimation in NMF based single-channel source separation. In: 14th Annual Conference of the International Speech Communication Association (InterSpeech 2013), Lyon, France
PDF
1119_Paper_final.pdf
Download (264kB)
1119_Paper_final.pdf
Download (264kB)
Abstract
We propose to use minimum mean squared error (MMSE) estimates to enhance the signals that are separated by nonnegative matrix factorization (NMF). In single channel source separation (SCSS), NMF is used to train a set of basis vectors for each source from their training spectrograms. Then NMF is used to decompose the mixed signal spectrogram as a weighted linear combination of the trained basis vectors from which estimates of each corresponding source can be obtained. In this work, we deal with the spectrogram of each separated signal as a 2D distorted signal that needs to be restored. A multiplicative distortion model is assumed where the logarithm of the true signal distribution is modeled with a Gaussian mixture model (GMM) and the distortion is modeled as having a log-normal distribution. The parameters of the GMM are learned from training data whereas the distortion parameters are learned online from each separated signal. The initial source estimates are improved and replaced with their MMSE estimates under this new probabilistic framework. The experimental results show that using the proposed MMSE estimation technique as a post enhancement after NMF improves the quality of the separated signal.
Item Type: | Papers in Conference Proceedings |
---|---|
Uncontrolled Keywords: | Single channel source separation, nonnegative matrix factorization, Minimum mean square error estimates, and Gaussian mixture models |
Subjects: | T Technology > T Technology (General) T Technology > TK Electrical engineering. Electronics Nuclear engineering |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Electronics Faculty of Engineering and Natural Sciences |
Depositing User: | Emad Mounir Grais Girgis |
Date Deposited: | 10 Oct 2013 14:27 |
Last Modified: | 26 Apr 2022 09:10 |
URI: | https://research.sabanciuniv.edu/id/eprint/21798 |