An accurate automated speaker counting architecture based on James Webb Pattern
Article
Article Title | An accurate automated speaker counting architecture based on James Webb Pattern |
---|---|
ERA Journal ID | 32032 |
Article Category | Article |
Authors | Barua, Prabal Datta, Yildiz, Arif Metehan, Canpolat, Nida, Keles, Tugce, Dogan, Sengul, Baygin, Mehmet, Tuncer, Ilknur, Tuncer, Turker, Tan, Ru-San, Fujita, Hamido and Acharya, U. Rajendra |
Journal Title | Engineering Applications of Artificial Intelligence |
Journal Citation | 119 |
Article Number | 105821 |
Number of Pages | 12 |
Year | 2023 |
Publisher | Elsevier |
Place of Publication | United Kingdom |
ISSN | 0952-1976 |
1873-6769 | |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.engappai.2023.105821 |
Web Address (URL) | https://www.sciencedirect.com/science/article/pii/S0952197623000052 |
Abstract | Speaker counting is an important research area in sound forensics. There are limited speaker counting papers in the literature, as it is challenging to collect datasets. This work aims to collect a new overlapping speech signal dataset for speaker counting and propose a novel feature engineering model. In this work, textural feature extraction is based on the iconic James Webb space telescope; hence, this pattern is named James Webb Pattern (JWPat). A new speaker counting speech dataset comprising 3,121 speeches divided into 32 classes (the class number corresponded to the number of speakers) was collected. A new framework that mimics the deep learning model has been proposed to classify the collected speech classes. The proposed feature engineering model is self-organized and uses various mother wavelet functions to generate features at both low and high levels. We have obtained the best classification accuracy of 86.74% using the symlet4 mother wavelet function. Using our proposed framework, eight classification results have been calculated with accuracy ranging from 75.94% to 86.74%. This range is over 10% accuracy, and it demonstrates the effect of the mother wavelet function on the classification performance. Moreover, the feature extraction capability of the mirror of the James Webb telescope has been demonstrated. Our proposed method yielded 86.74% accuracy on a large dataset and indicated the success of our proposed model. |
Keywords | Iterative neighborhood component analysis; James Webb pattern; Sound forensics; Speaker counting; Unbalanced tree discrete wavelet transform |
Contains Sensitive Content | Does not contain sensitive content |
ANZSRC Field of Research 2020 | 400306. Computational physiology |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Funder | Firat Üniversitesi |
Byline Affiliations | Ngee Ann Polytechnic, Singapore |
Singapore University of Social Sciences (SUSS), Singapore | |
Asia University, Taiwan | |
School of Business | |
University of Technology Sydney | |
Firat University, Turkey | |
Ardahan University, Turkiye | |
Government office in Elazig, Turkiye | |
National Heart Centre, Singapore | |
Duke-NUS Medical School, Singapore | |
HUTECH University of Technology, Vietnam | |
University of Granada, Spain | |
Iwate Prefectural University, Japan |
https://research.usq.edu.au/item/yyw60/an-accurate-automated-speaker-counting-architecture-based-on-james-webb-pattern
105
total views1
total downloads0
views this month0
downloads this month
Export as
Related outputs
Automated hip dysplasia detection using novel FlexiLBPHOG model with ultrasound images
Key, Sefa, Kurum, Huseyin, Esmez, Omer, Hafeez-Baig, Abdul, Hajiyeva, Rena, Dogan, Sengul and Tuncer, Turker. 2025. "Automated hip dysplasia detection using novel FlexiLBPHOG model with ultrasound images." Ain Shams Engineering Journal. 16 (1). https://doi.org/10.1016/j.asej.2024.103235Artificial Intelligence-Based Suicide Prevention and Prediction: A Systematic Review (2019-2023)
Atmakuru, Anirudh, Shahini, Alen, Chakraborty, Subrata, Seoni, Silvia, Salvi, Massimo, Hafeez-Baig, Abdul, Rashid, Sadaf, Tan, Ru San, Barua, Prabal Datta, Molinari, Filippo and Acharya, U Rajendra. 2025. "Artificial Intelligence-Based Suicide Prevention and Prediction: A Systematic Review (2019-2023)." Information Fusion. 114. https://doi.org/10.1016/j.inffus.2024.102673Explainable deeply-fused nets electricity demand prediction model: Factoring climate predictors for accuracy and deeper insights with probabilistic confidence interval and point-based forecasts
Ghimire, Sujan, AL-Musaylh, Mohanad S., Nguyen-Huy, Thong, Deo, Ravinesh C., Acharya, Rajendra, Casillas-Perez, David, Yaseen, Zaher Mundher and Salcedo-sanz, Sancho. 2025. "Explainable deeply-fused nets electricity demand prediction model: Factoring climate predictors for accuracy and deeper insights with probabilistic confidence interval and point-based forecasts." Applied Energy. 378 (Part A). https://doi.org/10.1016/j.apenergy.2024.124763AttentionPoolMobileNeXt: An automated construction damage detection model based on a new convolutional neural network and deep feature engineering models
Aydin, Mehmet, Barua, Prabal Datta, Chadalavada, Sreenivasulu, Dogan, Sengul, Tuncer, Turker, Chakraborty, Subrata and Acharya, Rajendra U.. 2025. "AttentionPoolMobileNeXt: An automated construction damage detection model based on a new convolutional neural network and deep feature engineering models." Multimedia Tools and Applications. 84 (4), pp. 1821-1843. https://doi.org/10.1007/s11042-024-19163-2Directed Lobish-based explainable feature engineering model with TTPat and CWINCA for EEG artifact classification
Tuncer, Turker, Dogan, Sengul, Baygin, Mehmet, Tasci, Irem, Mungen, Bulent, Tasci, Burak, Barua, Prabal Datta and Acharya, U.R.. 2024. "Directed Lobish-based explainable feature engineering model with TTPat and CWINCA for EEG artifact classification." Knowledge-Based Systems. 305. https://doi.org/10.1016/j.knosys.2024.112555Retinal Health Screening Using Artificial Intelligence with Digital Fundus Images: A Review of the Last Decade (2012-2023)
Islam, Saad, Deo, Ravinesh C., Barua, Prabal Datta, Soar, Jeffrey, Yu, Ping and Acharya, U. Rajendra. 2024. "Retinal Health Screening Using Artificial Intelligence with Digital Fundus Images: A Review of the Last Decade (2012-2023)." IEEE Access. 12, pp. 176630-176685. https://doi.org/10.1109/ACCESS.2024.3477420Automated EEG-based language detection using directed quantum pattern technique
Dogan, Sengul, Tuncer, Turker, Barua, Prabal Datta and Acharya, U.R.. 2024. "Automated EEG-based language detection using directed quantum pattern technique." Applied Soft Computing. 167 (Part A). https://doi.org/10.1016/j.asoc.2024.112301A Novel Hybrid Model for Automatic Non-Small Cell Lung Cancer Classification Using Histopathological Images
Katar, Oguzhan, Yildirim, Ozal, Tan, Ru-San and Acharya, U Rajendra. 2024. "A Novel Hybrid Model for Automatic Non-Small Cell Lung Cancer Classification Using Histopathological Images." Diagnostics. 14 (22). https://doi.org/10.3390/diagnostics14222497Synthetic Data Generation via Generative Adversarial Networks in Healthcare: A Systematic Review of Image- and Signal-Based Studies
Akpinar, Muhammed Halil, Sengur, Abdulkadir, Salvi, Massimo, Seoni, Silvia, Faust, Oliver, Mir, Hasan, Molinari,Filippo and Acharya, U. Rajendra. 2024. "Synthetic Data Generation via Generative Adversarial Networks in Healthcare: A Systematic Review of Image- and Signal-Based Studies." IEEE Open Journal of Engineering in Medicine and Biology. 6, pp. 183-192. https://doi.org/10.1109/OJEMB.2024.3508472RECOMED: A comprehensive pharmaceutical recommendation system
Zomorodi, Mariam, Ghodsollahee, Ismail, Martin, Jennifer H, Talley, Nicholas J, Salari, Vahid, Pławiak, Paweł, Rahimi, Kazem and Acharya, U.R.. 2024. "RECOMED: A comprehensive pharmaceutical recommendation system." Artificial Intelligence in Medicine. 157. https://doi.org/10.1016/j.artmed.2024.102981Artificial intelligence in assessing cardiovascular diseases and risk factors via retinal fundus images: A review of the last decade
Abdollahi, Mirsaeed, Jafarizadeh, Ali, Ghafouri-Asbagh, Amirhosein, Sobhi, Navid, Pourmoghtader, Keysan, Pedrammehr, Siamak, Asadi, Houshyar, Tan, Ru-San, Alizadehsani, Roohallah and Acharya, U. Rajendra. 2024. "Artificial intelligence in assessing cardiovascular diseases and risk factors via retinal fundus images: A review of the last decade." WIREs Data Mining and Knowledge Discovery. 14 (6). https://doi.org/10.1002/widm.1560Early prediction of sudden cardiac death using multimodal fusion of ECG Features extracted from Hilbert–Huang and wavelet transforms with explainable vision transformer and CNN models
Telangore, Hardik, Azad, Victor, Sharma, Manish, Bhurane, Ankit, Tan, Ru San and Acharya, U. Rajendra. 2024. "Early prediction of sudden cardiac death using multimodal fusion of ECG Features extracted from Hilbert–Huang and wavelet transforms with explainable vision transformer and CNN models." Computer Methods and Programs in Biomedicine. 257. https://doi.org/10.1016/j.cmpb.2024.108455A Pragmatic Approach to Fetal Monitoring via Cardiotocography Using Feature Elimination and Hyperparameter Optimization
Hardalac, Firat, Akmal, Haad, Ayturan, Kubilay, Acharya, U. Rajendra and Tan, Ru-San. 2024. "A Pragmatic Approach to Fetal Monitoring via Cardiotocography Using Feature Elimination and Hyperparameter Optimization." Interdisciplinary Sciences: Computational Life Sciences. 16 (4), pp. 882-906. https://doi.org/10.1007/s12539-024-00647-6Automated System for the Detection of Heart Anomalies Using Phonocardiograms: A Systematic Review
Gudigar, Anjan, Raghavendra, U., Maithri, M., Samanth, Jyothi, Inamdar, Mahesh Anil, Vidhya, V., Vicnesh, Jahmunah, Prabhu, Mukund A., Tan, Ru-San, Yeong, Chai Hong, Molinari, Filippo and Acharya, U. R.. 2024. "Automated System for the Detection of Heart Anomalies Using Phonocardiograms: A Systematic Review." IEEE Access. 12, pp. 138399-138428. https://doi.org/10.1109/ACCESS.2024.3465511