Computer ScienceFindings of the Association for Computational Linguistics: NAACL 2025

Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation

P. Goel, D. S. Chauhan, et al.

Sarcasm often hides its target — and TURBO uncovers it. This study introduces TURBO, a Target-aUgmented shaRed fusion-Based sarcasm explanation model that leverages a novel shared-fusion mechanism to fuse image and caption and explicitly models the sarcasm target to generate clearer explanations. Evaluated on MORE+, TURBO outperforms baselines by an average +3.3%, and human assessments find its explanations superior; the authors also examine LLMs in zero/one-shot settings and note their limitations. This research was conducted by Palaash Goel, Dushyant Singh Chauhan, and Md Shad Akhtar.... show more

General Summary Metrics

Abstract

Sarcasm is a linguistic phenomenon that intends to ridicule a target (e.g., entity, event, or person) in an inherent way. Multimodal Sarcasm Explanation (MuSE) aims at revealing the intended irony in a sarcastic post using a natural language explanation. Though important, existing systems overlooked the significance of the target of sarcasm in generating explanations. In this paper, we propose a Target-aUgmented shaRed fusion-Based sarcasm explanation model, aka. TURBO. We design a novel shared-fusion mechanism to leverage the inter-modality relationships between an image and its caption. TURBO assumes the target of the sarcasm and guides the multimodal shared fusion mechanism in learning intricacies of the intended irony for explanations. We evaluate our proposed TURBO model on the MORE+ dataset. Comparison against multiple baselines and state-of-the-art models signifies the performance improvement of TURBO by an average margin of +3.3%. Moreover, we explore LLMs in zero and one-shot settings for our task and observe that LLM-generated explanation, though remarkable, often fails to capture the critical nuances of the sarcasm. Furthermore, we supplement our study with extensive human evaluation on TURBO's generated explanations and find them out to be comparatively better than other systems.

Publisher

Findings of the Association for Computational Linguistics: NAACL 2025

Published On

Authors

Palaash Goel, Dushyant Singh Chauhan, Md Shad Akhtar

DOI

https://doi.org/10.48550/arXiv.2502.07391

Explore these studies to deepen your understanding

Adjacent work that informs or extends this paper's methodology and findings.

Computer Science

Fusion-based quantum computation

S. Bartolucci, P. Birchall, et al.

Engineering and Technology

A neuromorphic physiological signal processing system based on VO₂ memristor for next-generation human-machine interface

R. Yuan, P. J. Tiw, et al.

Psychology

Effectiveness of Augmented and Virtual Reality-Based Interventions in Improving Knowledge, Attitudes, Empathy and Stigma Regarding People with Mental Illnesses-A Scoping Review

T. J.l., X. H., et al.

Engineering and Technology

Quantum random number generation based on a perovskite light emitting diode

J. Argillander, A. Alarcón, et al.

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 22+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny