Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition
Article
Article Title | Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition |
---|---|
ERA Journal ID | 210567 |
Article Category | Article |
Authors | Rajapakshe, Thejan, Rana, Rajib, Khalifa, Sara and Schuller, Björn W. |
Journal Title | IEEE Access |
Journal Citation | 12, pp. 193101-193114 |
Number of Pages | 14 |
Year | 2024 |
Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
Place of Publication | United States |
ISSN | 2169-3536 |
Digital Object Identifier (DOI) | https://doi.org/10.1109/ACCESS.2024.3519761 |
Web Address (URL) | https://ieeexplore.ieee.org/document/10806705 |
Abstract | Speech-emotion recognition (SER) enables computers to engage with people in an emotionally intelligent way. The inability to adapt an existing model to a new domain is one of the significant limitations of SER methods. To overcome this challenge, domain adaptation techniques have been developed to transfer the knowledge learnt by a model across domains. Although existing domain adaptation techniques have improved the performance of SER models across domains, there is a need to improve their ability to adapt to real-world situations where models can self-tune while deployed. This paper presents a deep reinforcement learning-based strategy (RL-DA) for adapting a pre-trained SER model to a real-world setting by interacting with the environment and collecting continuous feedback. The proposed RL-DA technique is evaluated on SER tasks, including cross-corpus and cross-language domain adaptation scenarios. Our evaluation results show that RL-DA achieves significant improvements of 11% and 14% in testing accuracy over a fully supervised baseline for cross-corpus and cross-language scenarios, respectively, in the real-world setting. This technique also outperforms the baseline model’s performance for both speaker independent and speaker dependent SER tasks. |
Keywords | Reinforcement learning; speech emotion recognition; domain adaptation |
Contains Sensitive Content | Does not contain sensitive content |
ANZSRC Field of Research 2020 | 461103. Deep learning |
Byline Affiliations | School of Mathematics, Physics and Computing |
Queensland University of Technology | |
University of Augsburg, Germany | |
Technical University of Munich, Germany |
https://research.usq.edu.au/item/zqwvq/domain-adapting-deep-reinforcement-learning-for-real-world-speech-emotion-recognition
Download files
Published Version
Domain_Adapting_Deep_Reinforcement_Learning_for_Real-World_Speech_Emotion_Recognition.pdf | ||
License: CC BY 4.0 | ||
File access level: Anyone |
14
total views1
total downloads14
views this month1
downloads this month