Browse by author
Lookup NU author(s): Professor Hongsheng DaiORCiD
This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
The rapid growth of data volume and velocity is challenging traditional methods of classification, making it impossible to store so much data in memory. Developing online classification methods is becoming increasingly important. Online classification approaches have been subject to several assumptions in recent years, such as an independent and consistent distribution of initial values. In this paper, we present a general online Bayesian classification algorithm that can be adapted to handle streaming data based on the Monte Carlo fusion method. Rather than storing the entire raw data, we only update the posterior with historical data and the current batch of data. In addition, the proposed method has the capability of being applied to a wide range of situations with a high degree of accuracy, even in the case of data imbalances and discrepancies between sub-posterior distributions. The proposed algorithm was validated and evaluated through a comprehensive simulation study.
Author(s): Shi E, Xie J, Hu S, Sun K, Dai H, Jiang B, Kong L, Li L
Publication type: Article
Publication status: Published
Journal: Journal of Nonparametric Statistics
Year: 2024
Pages: ePub ahead of Print
Online publication date: 09/07/2024
Acceptance date: 23/05/2024
Date deposited: 13/06/2024
ISSN (print): 1048-5252
ISSN (electronic): 1029-0311
Publisher: Taylor & Francis
URL: https://doi.org/10.1080/10485252.2024.2368631
DOI: 10.1080/10485252.2024.2368631
ePrints DOI: 10.57711/p31x-zn59
Altmetrics provided by Altmetric