logo
ResearchBunny Logo
Implementing machine learning techniques for continuous emotion prediction from uniformly segmented voice recordings

Psychology

Implementing machine learning techniques for continuous emotion prediction from uniformly segmented voice recordings

H. Diemerling, L. Stresemann, et al.

Discover a groundbreaking method for predicting emotions from short audio samples! Researchers Hannes Diemerling, Leonie Stresemann, Tina Braun, and Timo von Oertzen have leveraged advanced machine learning techniques to achieve accuracy that rivals human evaluative benchmarks. Dive into the world of real-time emotion detection!

00:00
00:00
~3 min • Beginner • English
Abstract
Introduction: Emotional recognition from audio recordings is a rapidly advancing field, with significant implications for artificial intelligence and human-computer interaction. This study introduces a novel method for detecting emotions from short, 1.5 s audio samples, aiming to improve accuracy and efficiency in emotion recognition technologies. Methods: We utilized 1,510 unique audio samples from two databases in German and English to train our models. We extracted various features for emotion prediction, employing Deep Neural Networks (DNN) for general feature analysis, Convolutional Neural Networks (CNN) for spectrogram analysis, and a hybrid model combining both approaches (C-DNN). The study addressed challenges associated with dataset heterogeneity, language differences, and the complexities of audio sample trimming. Results: Our models demonstrated accuracy significantly surpassing random guessing, aligning closely with human evaluative benchmarks. This indicates the effectiveness of our approach in recognizing emotional states from brief audio clips. Discussion: Despite the challenges of integrating diverse datasets and managing short audio samples, our findings suggest considerable potential for this methodology in real-time emotion detection from continuous speech. This could contribute to improving the emotional intelligence of AI and its applications in various areas.
Publisher
Frontiers in Psychology
Published On
Authors
Hannes Diemerling, Leonie Stresemann, Tina Braun, Timo von Oertzen
Tags
emotion prediction
machine learning
audio analysis
Deep Neural Networks
real-time detection
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny