Computer Science

Shared functional specialization in transformer-based language models and the human brain

S. Kumar, T. R. Sumers, et al.

Discover groundbreaking insights into how transformer-based language models, like BERT, align with human brain activity in language processing. This research by Sreejan Kumar and colleagues reveals significant correlations between model computations and specific brain regions, suggesting shared computational principles that bridge machine learning and neuroscience.

00:00

Playback language: English

Index

Abstract

This paper investigates the functional specialization in transformer-based language models (like BERT) and its correspondence to human brain activity during language processing. By analyzing the "transformations" (computations performed by attention heads) within the Transformer architecture, the researchers demonstrate that these transformations account for significant variance in brain activity across the cortical language network. Furthermore, they show a structured relationship between specific attention heads, their computations, and activity in particular brain regions. These findings suggest shared computational principles between the models and the human brain.

Publisher

Nature Communications

Published On

Jun 29, 2024

Authors

Sreejan Kumar, Theodore R. Sumers, Takateru Yamakoshi, Ariel Goldstein, Uri Hasson, Kenneth A. Norman, Thomas L. Griffiths, Robert D. Hawkins, Samuel A. Nastase

Related Publications

Explore these studies to deepen your understanding of the subject.

Psychology

Object representations in the human brain reflect the co-occurrence statistics of vision and language

M. F. Bonner and R. A. Epstein

Psychology

Concept and location neurons in the human brain provide the ‘what’ and ‘where’ in memory formation

S. Mackay, T. P. Reber, et al.

Linguistics and Languages

Shared and distinct neural correlates of first and second language morphological processing in bilingual brain

F. Gao, L. Hua, et al.

Psychology

Comparing supervised and unsupervised approaches to emotion categorization in the human brain, body, and subjective experience

B. Azari, C. Westlin, et al.

Listen, Learn & Level Up

Over 10,000 hours of research content in 25+ fields, available in 12+ languages.

No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.

listen to research audio papers with researchbunny