Please use this identifier to cite or link to this item:
http://localhost:8080/xmlui/handle/123456789/8206
Title: | Speech Gender Recognition Using a Multilayer Feature Extraction Method |
Authors: | Abdulmohsin, H. A. Hasan, S. S. Al-Khateeb, Belal |
Keywords: | Automatic speech recognition Speech gender recognition Backpropagation NN GMM Common voice dataset |
Issue Date: | 1-Jan-2022 |
Publisher: | IEEE |
Abstract: | Human speech contains paralinguistic properties used in automatic speech recognition (ASR) systems. These properties are used in manyASRapplications such as gender recognition, which is the main goal of this paper. Gender recognition has been the target of many researchers since recognizing the human gender (female or male) is essential in many applications especially in security applications. Through this work, an ASR has been proposed and implemented. The main goal of any ASR system is to determine the best features that can address the required recognition. The features deployed in this work are smoothness, pitch, the first two formants and spectral centroid variability (SCV). The new approach proposed in this work was using the analysis of variance (ANOVA) as a feature selector to choose the best combination of features that can lead to the best classification accuracy, and then apply the decision tree feature selection algorithm to choose the best group of features. Then use backpropagation neural network (NN), Gaussian mixture models (GMM) and SVM as separate classifiers. The common voice dataset was used as benchmark dataset through all experiments of this work. The best result gained with respect to the three genders was 74.87% using the pitch and the first two formant features and classified by NN. The best result gained with respect to the two genders (female and male) was 97.71% using the pitch, and the first two formant features are classified by NN. |
URI: | http://localhost:8080/xmlui/handle/123456789/8206 |
Appears in Collections: | قسم علوم الحاسبات |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
111111111111.pdf | 237.39 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.