Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition
Article
| Article Title | Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition |
|---|---|
| ERA Journal ID | 210567 |
| Article Category | Article |
| Authors | Rajapakshe, Thejan, Rana, Rajib, Khalifa, Sara and Schuller, Björn W. |
| Journal Title | IEEE Access |
| Journal Citation | 12, pp. 193101-193114 |
| Number of Pages | 14 |
| Year | 2024 |
| Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
| Place of Publication | United States |
| ISSN | 2169-3536 |
| Digital Object Identifier (DOI) | https://doi.org/10.1109/ACCESS.2024.3519761 |
| Web Address (URL) | https://ieeexplore.ieee.org/document/10806705 |
| Abstract | Speech-emotion recognition (SER) enables computers to engage with people in an emotionally intelligent way. The inability to adapt an existing model to a new domain is one of the significant limitations of SER methods. To overcome this challenge, domain adaptation techniques have been developed to transfer the knowledge learnt by a model across domains. Although existing domain adaptation techniques have improved the performance of SER models across domains, there is a need to improve their ability to adapt to real-world situations where models can self-tune while deployed. This paper presents a deep reinforcement learning-based strategy (RL-DA) for adapting a pre-trained SER model to a real-world setting by interacting with the environment and collecting continuous feedback. The proposed RL-DA technique is evaluated on SER tasks, including cross-corpus and cross-language domain adaptation scenarios. Our evaluation results show that RL-DA achieves significant improvements of 11% and 14% in testing accuracy over a fully supervised baseline for cross-corpus and cross-language scenarios, respectively, in the real-world setting. This technique also outperforms the baseline model’s performance for both speaker independent and speaker dependent SER tasks. |
| Keywords | Reinforcement learning; speech emotion recognition; domain adaptation |
| Contains Sensitive Content | Does not contain sensitive content |
| ANZSRC Field of Research 2020 | 461103. Deep learning |
| Byline Affiliations | School of Mathematics, Physics and Computing |
| Queensland University of Technology | |
| University of Augsburg, Germany | |
| Technical University of Munich, Germany |
https://research.usq.edu.au/item/zqwvq/domain-adapting-deep-reinforcement-learning-for-real-world-speech-emotion-recognition
Download files
Published Version
| Domain_Adapting_Deep_Reinforcement_Learning_for_Real-World_Speech_Emotion_Recognition.pdf | ||
| License: CC BY 4.0 | ||
| File access level: Anyone | ||
54
total views60
total downloads1
views this month3
downloads this month