logo
ResearchBunny Logo
Semantic noise in the Winograd Schema Challenge of pronoun disambiguation

Linguistics and Languages

Semantic noise in the Winograd Schema Challenge of pronoun disambiguation

S. D. Jager

This intriguing paper by S. de Jager reveals that pronoun disambiguation within Winograd Schemas is not as effortlessly executed by humans as previously assumed. By uncovering the concept of semantic noise, it highlights the pitfalls of oversimplification in NLP, shedding light on how our understanding of commonsense knowledge may be more complex than we think.... show more
Abstract
The Winograd Schema Challenge (WSC) of pronoun disambiguation is a Natural Language Processing (NLP) task designed to test to what extent the reading comprehension capabilities of language models (LMs) can be compared to those of human subjects. It is generally assumed across the NLP literature that human subjects are capable of resolving this task because of their acquired commonsense knowledge, thus setting a commonsense benchmark for LMs, one which has even been proposed as an alternative to the Turing test. In the context of complex natural language communications, Shannon and Weaver observed that the act of semantic interpretation is subject to semantic noise (Shannon and Weaver, 1964 (1949)). Semantic noise is a constraint that ensues from terms exhibiting variable interpretations across contexts, presenting a challenge to the resolution of tasks such as the WSC. However, the main argument of this paper is that rather than seeing semantic noise as a challenge to otherwise unambiguous communication, it can also be understood as a functional quality of natural language, given that it results in the conceptual negotiation of terms. Failing to theoretically attend to this linguistic matter of fact leads to unintended problems in instances where NLP applications are offered as unbiased or objectively applicable solutions. To address this, this article offers a renewed and original analysis of a series of Winograd Schemas, in order to demonstrate how they are not as straightforwardly solvable by human subjects as is commonly claimed across the NLP literature. The methodology employed is that of historical contextualisation in information theory, and qualitative cultural analysis drawing on examples from a wide variety of recent NLP literature.
Publisher
Humanities and Social Sciences Communications
Published On
Apr 11, 2023
Authors
S. de Jager
Tags
Natural Language Processing
Winograd Schemas
pronoun disambiguation
semantic noise
information theory
commonsense knowledge
NLP applications
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny