Computer ScienceProceedings of the 1st Workshop on Computational Humor (CHum)

Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor

A. Baluja

LLMs excel at text but can miss the punchline—humor often depends on pronunciation and intonation. This study presents a simple multimodal prompting method that supplies an LLM with both the joke text and a TTS-generated spoken form, improving humor explanations across datasets. Research conducted by Ashwin Baluja (Northwestern University).... show more

General Summary Metrics

Abstract

While Large Language Models (LLMs) have demonstrated impressive natural language understanding capabilities across various text-based tasks, understanding humor has remained a persistent challenge. Humor is frequently multimodal, relying not only on the meaning of the words, but also their pronunciations, and even the speaker's intonations. In this study, we explore a simple multimodal prompting approach to humor understanding and explanation. We present an LLM with both the text and the spoken form of a joke, generated using an off-the-shelf text-to-speech (TTS) system. Using multimodal cues improves the explanations of humor compared to textual prompts across all tested datasets.

Publisher

Proceedings of the 1st Workshop on Computational Humor (CHum)

Published On

Jan 19, 2025

Authors

Ashwin Baluja

DOI

https://doi.org/10.48550/arXiv.2412.05315

Explore these studies to deepen your understanding

Adjacent work that informs or extends this paper's methodology and findings.

Computer Science

Attention Is All You Need

A. Vaswani, N. Shazeer, et al.

Psychology

Not all mindfulness is equal: certain facets of mindfulness have important implications for well-being and mental health across the lifespan

N. J. Johnson, R. J. Smith, et al.

Food Science and Technology

Obesity, but not high-fat diet, is associated with bone loss that is reversed via CD4+CD25+Foxp3+ Tregs-mediated gut microbiome of non-obese mice

W. Song, Q. Sheng, et al.

Political Science

A rather wild imagination: who is and who is not a migrant in the Czech media and society?

M. G. Bartoszewicz and O. Eibl

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 22+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny