logo
ResearchBunny Logo
Addressing the Blind Spots in Spoken Language Processing

Computer Science

Addressing the Blind Spots in Spoken Language Processing

A. Moryossef

Discover how Amit Moryossef explores the overlooked dimensions of human communication in NLP, emphasizing the importance of nonverbal cues! This research innovatively integrates techniques from sign language processing to enhance automatic gesture recognition, bridging the digital gap of text-based analysis and real-world interactions.... show more
Abstract
This paper explores the critical but often overlooked role of non-verbal cues, including co-speech gestures and facial expressions, in human communication and their implications for Natural Language Processing (NLP). We argue that understanding human communication requires a more holistic approach that goes beyond textual or spoken words to include nonverbal elements. Borrowing from advances in sign language processing, we propose the development of universal automatic gesture segmentation and transcription models to transcribe these non-verbal cues into textual form. Such a methodology aims to bridge the blind spots in spoken language understanding, enhancing the scope and applicability of NLP models. Through motivating examples, we demonstrate the limitations of relying solely on text-based models. We propose a computationally efficient and flexible approach for incorporating non-verbal cues, which can seamlessly integrate with existing NLP pipelines. We conclude by calling upon the research community to contribute to the development of universal transcription methods and to validate their effectiveness in capturing the complexities of real-world, multi-modal interactions.
Publisher
This is not specified in the provided text
Published On
Jan 01, 2023
Authors
Amit Moryossef
Tags
Natural Language Processing
nonverbal cues
gesture segmentation
sign language processing
multimodal interactions
communication analysis
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny