Computer Science

On Multimodal Emotion Recognition for Human-Chatbot Interaction in the Wild

N. Kovačević, M. Gross, et al.

A study conducted by Nikola Kovačević, Markus Gross, Christian Holz, and Rafael Wampfler collected multimodal text, audio, and video from 99 participants interacting with a GPT-3 chatbot over three weeks, uncovering a strong domain gap between human-human and human-chatbot emotion signals and showing personalization can boost recognition by up to 38–41%.

00:00

~3 min • Beginner • English

Index

Abstract

The field of natural language generation is swiftly evolving, giving rise to powerful conversational characters for use in different applications such as entertainment, education, and healthcare. A central aspect of these applications is providing personalized interactions, driven by the ability of the characters to recognize and adapt to user emotions. Current emotion recognition models primarily rely on datasets collected from actors or in controlled laboratory settings focusing on human-human interactions, which hinders their adaptability to real-world applications for conversational agents. In this work, we unveil the complexity of human-chatbot emotion recognition in the wild. We collected a multimodal dataset consisting of text, audio, and video recordings from 99 participants while they conversed with a GPT-3-based chatbot over three weeks. Using different transformer-based multimodal emotion recognition networks, we provide evidence for a strong domain gap between human-human interaction and human-chatbot interaction that is attributed to the subjective nature of self-reported emotion labels, the reduced activation and expressivity of the face, and the inherent subtlety of emotions in such settings, emphasizing the challenges of recognizing user emotions in real-world contexts. We show how personalizing our model to the user increases the model performance by up to 38% (user emotions) and up to 41% (perceived chatbot emotions), highlighting the potential of personalization for overcoming the observed domain gap.

Publisher

Proceedings of the International Conference on Multimodal Interaction (ICMI '24)

Published On

Nov 04, 2024

Authors

Nikola Kovačević, Markus Gross, Christian Holz, Rafael Wampfler

DOI

https://doi.org/10.1145/3678957.3685759

Related Publications

Explore these studies to deepen your understanding of the subject.

Interdisciplinary Studies

The concept of "interaction" in debates on human-machine interaction

S. Schleidgen, O. Friedrich, et al.

Health and Fitness

Considerations on the human Achilles tendon moment arm for in vivo triceps surae muscle-tendon unit force estimates

D. Holzer, F. K. Paternoster, et al.

Education

Not-for-profit or for-profit? Research on the high-quality development path of private universities in China based on system dynamics

S. Duan, H. Yang, et al.

Social Work

Democratizing the discourse on criminal justice in social media: the activity for justice for Roman Zadorov as a case study

A. Lev-on

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 12+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny