Mental Health Diagnosis From Voice Data Using Convolutional Neural Networks and Vision Transformers
Article
Article Title | Mental Health Diagnosis From Voice Data Using Convolutional Neural Networks and Vision Transformers |
---|---|
ERA Journal ID | 9234 |
Article Category | Article |
Authors | Islam, Rafiul, Ahad, Md. Taimur, Ahmed, Faruk, Song, Bo and Li, Yan |
Journal Title | Journal of Voice |
Number of Pages | 12 |
Year | 2024 |
Publisher | Elsevier |
Place of Publication | United States |
ISSN | 0892-1997 |
1873-4588 | |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.jvoice.2024.10.010 |
Web Address (URL) | https://www.sciencedirect.com/science/article/pii/S0892199724003539 |
Abstract | Integrating Convolutional Neural Networks and Vision Transformers in voice analysis has unveiled a new horizon in mental health identification. Human voice, a powerful indicator of mental health, was the focus of this study. Human voice data representing stable and unstable conditions were gathered from various mental health institutions in Bangladesh. The results of the experiment suggest that the proposed model achieved 91% accuracy, precision of 92% for the “Unstable” category and 90% for the “Stable” category, and recall of 91% for the “Stable” category and 92% for the “Unstable” category. In addition, a high F1 score of 91% was achieved. This study significantly contributes to computer-aided diagnosis in mental health by using deep learning (DL) to diagnose mental well-being. Our research underscores the substantial impact of DL on the advancement of mental health care, instilling hope for a brighter future in mental health care. |
Keywords | Machine learning; Mental stability; Mental health diagnostics; Convolutional Neural Network; Vision Transformer; Voice analysis; Integrated model; Early detection |
Contains Sensitive Content | Does not contain sensitive content |
ANZSRC Field of Research 2020 | 400399. Biomedical engineering not elsewhere classified |
461104. Neural networks | |
Public Notes | Files associated with this item cannot be displayed due to copyright restrictions. |
Byline Affiliations | University of Dhaka, Bangladesh |
School of Mathematics, Physics and Computing | |
School of Engineering |
https://research.usq.edu.au/item/zq75w/mental-health-diagnosis-from-voice-data-using-convolutional-neural-networks-and-vision-transformers
4
total views0
total downloads4
views this month0
downloads this month