Browse by author
Lookup NU author(s): Dr Varun OjhaORCiD
Full text for this publication is not currently held within this repository. Alternative links are provided below where available.
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG. This research is concerned with taking user input in the form of speech data to classify and then predict which region of the United Kingdom the user is from and their gender. This research was conducted on regional accents, data preprocessing, Fourier transforms, and deep learning modeling. Due to lack of publicly available datasets for this type of research, a dataset was created from scratch (12 regions with a 1:1 gender ratio). In this paper, we propose modeling the human’s voice accent and voice gender recognition as a classification task. We used a deep convolution neural network, and experimentally developed an architecture that maximized the classification accuracy of the mentioned tasks simultaneously. We also tested the model on publicly available spoken digit detests. We find that the gender classification is relatively easier to predict with high accuracy than the accent in our proposed multi-class classification model. Accent classification was found difficult because of the regional accent’s overlapping that prevents it from being classified with high accuracy.
Author(s): Shergill JS, Pravin C, Ojha V
Publication type: Conference Proceedings (inc. Abstract)
Publication status: Published
Conference Name: 20th International Conference on Hybrid Intelligent Systems (HIS 2020)
Year of Conference: 2021
Pages: 62-72
Print publication date: 17/04/2021
Online publication date: 16/04/2021
Acceptance date: 02/04/2018
ISSN: 2194-5357
Publisher: Springer
URL: https://doi.org/10.1007/978-3-030-73050-5_7
DOI: 10.1007/978-3-030-73050-5_7
Library holdings: Search Newcastle University Library for this item
Series Title: Advances in Intelligent Systems and Computing
ISBN: 9783030730499