Psychology

How young children integrate information sources to infer the meaning of words

M. Bohn, M. H. Tessler, et al.

Discover how children develop a rich vocabulary before school by integrating various information sources, as revealed by groundbreaking research from Manuel Bohn, Michael Henry Tessler, Megan Merrick, and Michael C. Frank. Their innovative Bayesian inference model sheds light on children's word learning and offers a fresh perspective on language acquisition.

00:00

Playback language: English

Index

Introduction

Humans possess unparalleled communicative abilities, with language forming the foundation of culture and society. A critical aspect of language learning is referent identification – determining the word-object relationship. This isn't achieved through a single cue but rather inferentially, by considering the speaker's intentions and the social context. Children utilize various mechanisms to process social-contextual information, including expecting speakers to use new words for unknown objects, to speak about relevant or novel objects, and to relate their words to the ongoing conversation. However, these mechanisms are often studied in isolation, ignoring the complex interplay of information during natural social interaction. This research aims to address the critical gap in understanding how children arbitrate between multiple sources of information to accurately infer speaker intent and successfully learn words. Existing research shows preschoolers use social-contextual information when learning words, but lacks a clear explanation of the *process* of integration. This study presents a theory of this integration process, proposing that it's a Bayesian inference system encompassing different information sources: expectations of cooperative and informative communication, shared common ground, and existing semantic knowledge. The study will use this model to predict and explain children's behavior in word-learning scenarios.

Literature Review

Early research on word learning focused on individual mechanisms children use to interpret and learn words from social-contextual information. Studies highlighted the use of multiple information sources like the speaker's perspective and semantics. While these studies demonstrated the use of multiple information sources, they lacked a detailed specification of the integration process. Existing social-pragmatic theories assume information integration as part of social inference, but without a clear definition of the process. This study bridges this gap by offering a quantitative theory of information integration.

Methodology

The study uses a rational-integration model based on Bayesian inference to describe the integration of three information sources: (1) expectations of cooperative communication; (2) shared common ground (conversational context); and (3) semantic knowledge. The model formalizes the computational steps of this social inference process, treating word learning as the outcome of a social inference process. To test the model's predictive and explanatory power, a word-learning experiment was conducted involving 368 children (2–5 years old) interacting with storybook speakers on a tablet computer. The experiment systematically manipulated the three information sources, creating scenarios where information sources were either aligned or in conflict. The experiment used storybook speakers displayed on a tablet to simulate social interactions, replicating previous findings using live interactions. Three experiments were conducted: Experiment 1 estimated children's sensitivity to informativeness and semantic knowledge; Experiment 2 estimated sensitivity to common ground; and Experiment 3 tested children's ability to integrate all three information sources in 24 unique conditions. The model's predictions were generated from the parameters derived from Experiments 1 and 2 (parameter-free a priori predictions), then compared to the data from Experiment 3. To evaluate the model's explanatory power, it was compared to three lesioned models (ignoring one of the three information sources) and two alternative integration models (biased-integration and developmental-bias). Generalized linear mixed models were used to analyze the effect of the manipulations on children's behavior, while Bayesian model comparison was used to evaluate the models' explanatory power.

Key Findings

The rational-integration model accurately predicted children's word-learning behavior across the entire age range (2–5 years), explaining 79% of the variance in Experiment 3 data. This supports the model's assumption that children integrate all three information sources. Model comparison revealed that the rational-integration model significantly outperformed the lesioned models (Bayes Factors ranging from 3.9 x 10⁹ to 4.8 x 10¹¹), indicating that children did not ignore any information sources. Comparison with the biased-integration model, which assumes that children weight different inferences separately, showed that the rational-integration model provided a significantly better fit to the data (BF₁₀ = 2.1 × 10¹⁰). Similarly, the developmental-bias model, which assumes the weighting changes with age, was also outperformed by the rational-integration model (BF₁₀ = 1.4 × 10⁶). These findings strongly suggest that children employ a fully integrated, rather than a modular, process for integrating information sources during word learning. The rational-integration model showed a better fit to the data in both congruent (all sources point to the same referent) and incongruent (sources point to different referents) conditions, particularly evident in younger children. Overall, the rational-integration model explained 87% of the variance in the data when parameters were constrained by all three experiments, emphasizing its superior performance over other models.

Discussion

This research demonstrates that young children, even from early in development, successfully integrate multiple information sources during language learning. The findings support a rational-integration model based on Bayesian inference, suggesting that these sources contribute to a single integrated social inference process. This contrasts with the ‘bag of tricks’ view, which proposes independent, separately developing mechanisms. The study's model provides an explicit computational description of this inference process and its development, suggesting that increased sensitivity to individual information sources, rather than changes in the integration process itself, drives developmental changes in word learning. The model's success in predicting and explaining behavior in both congruent and incongruent conditions highlights the importance of considering the interplay of various information sources in word learning. The consistent application of Bayesian inference across different ages suggests a developmental continuity in how information is integrated, even though children's sensitivity to each information source changes. The general framework underlying this model, applicable to adult language comprehension, suggests the integration of pragmatic inference is deeply rooted in social cognition.

Conclusion

This study provides strong evidence for a rational-integration model of word learning in young children, demonstrating the ability to integrate multiple information sources using Bayesian inference. The model's success in prediction and explanation surpasses simpler models, indicating a unified, rather than modular, integration process that remains consistent across development. Future research should explore cross-cultural variations in sensitivity to information sources while maintaining the core integration mechanism. Further investigation into the computational processes underlying common ground reasoning and the integration of multi-dimensional processes is also needed. Extending the model to incorporate non-verbal cues will further enhance our understanding of the complexity of word learning.

Limitations

The study's participants were primarily from a diverse, high socioeconomic status population in San José, California, potentially limiting the generalizability of the findings to other populations. The use of tablet-based storybooks, while replicating previous findings with live interactions, may not fully capture the dynamics of real-life social interactions. The model simplifies some aspects of word learning, such as the unidimensional representation of complex processes like common ground and speaker informativeness. These simplifications, while allowing for computational tractability, may not fully capture the nuances of these processes.

Related Publications

Explore these studies to deepen your understanding of the subject.

Linguistics and Languages

Sound-meaning associations allow listeners to infer the meaning of foreign language words

S. Hayakawa and V. Marian

Medicine and Health

Assessment of COVID-19 as the underlying cause of death among children and young people aged 0 to 19 years in the US

F. S, W. C, et al.

Social Work

The fingerprints of misinformation: how deceptive content differs from reliable sources in terms of cognitive effort and appeal to emotions

C. Carrasco-farré

Business

How to really quantify the economic value of customer information in corporate databases

C. Lamela-orcasitas and J. García-madariaga

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 12+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny