Browse by author
Lookup NU author(s): Dr Nicola Lazzarini
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
© 2015 Elsevier B.V.Many classification problems must deal with imbalanced datasets where one class - the majority class - outnumbers the other classes. Standard classification methods do not provide accurate predictions in this setting since classification is generally biased towards the majority class. The minority classes are oftentimes the ones of interest (e.g., when they are associated with pathological conditions in patients), so methods for handling imbalanced datasets are critical.Using several different datasets, this paper evaluates the performance of state-of-the-art classification methods for handling the imbalance problem in both binary and multi-class datasets. Different strategies are considered, including the one-class and dimension reduction approaches, as well as their fusions. Moreover, some ensembles of classifiers are tested, in addition to stand-alone classifiers, to assess the effectiveness of ensembles in the presence of imbalance. Finally, a novel ensemble of ensembles is designed specifically to tackle the problem of class imbalance: the proposed ensemble does not need to be tuned separately for each dataset and outperforms all the other tested approaches.To validate our classifiers we resort to the KEEL-dataset repository, whose data partitions (training/test) are publicly available and have already been used in the open literature: as a consequence, it is possible to report a fair comparison among different approaches in the literature.Our best approach (MATLAB code and datasets not easily accessible elsewhere)
Author(s): Nanni L, Fantozzi C, Lazzarini N
Publication type: Article
Publication status: Published
Journal: Neurocomputing
Year: 2015
Volume: 158
Pages: 48-61
Print publication date: 22/06/2015
Online publication date: 13/02/2015
Acceptance date: 27/01/2015
ISSN (print): 0925-2312
ISSN (electronic): 1872-8286
Publisher: Elsevier
URL: http://doi.org/10.1016/j.neucom.2015.01.068
DOI: 10.1016/j.neucom.2015.01.068
Altmetrics provided by Altmetric