Unsupervised learning for monaural source separation using maximization-minimization algorithm with time-frequency deconvolution

Woo, WL; Gao, B; Bouridane, A; Ling, BW-K; Chin, CS

doi:10.3390/s18051371

Unsupervised learning for monaural source separation using maximization-minimization algorithm with time-frequency deconvolution

Lookup NU author(s): Dr Wai Lok Woo, Dr Bin Gao, Professor Cheng Chin ORCiD

Downloads

Published version [.pdf]

Licence

This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).

Abstract

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time-frequency deconvolution with optimized fractional β-divergence. The β-divergence is a group of cost functions parametrized by a single parameter β. The Itakura-Saito divergence, Kullback-Leibler divergence and Least Square distance are special cases that correspond to β = 0, 1, 2, respectively. This paper presents a generalized algorithm that uses a flexible range of β that includes fractional values. It describes a maximization-minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time-frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional β value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy.

Publication metadata

Author(s): Woo WL, Gao B, Bouridane A, Ling BW-K, Chin CS

Publication type: Article

Publication status: Published

Journal: Sensors

Year: 2018

Volume: 18

Issue: 5

Online publication date: 27/04/2018

Acceptance date: 24/04/2018

Date deposited: 30/05/2018

ISSN (electronic): 1424-8220

Publisher: MDPIAG

URL: https://doi.org/10.3390/s18051371

DOI: 10.3390/s18051371

Altmetrics

Altmetrics provided by Altmetric

ePrints

Unsupervised learning for monaural source separation using maximization-minimization algorithm with time-frequency deconvolution

Downloads

Licence

Abstract

Publication metadata

Altmetrics

Share