logo
ResearchBunny Logo
Measuring linguistic complexity in Chinese: An information-theoretic approach

Linguistics and Languages

Measuring linguistic complexity in Chinese: An information-theoretic approach

X. Liu, F. Li, et al.

Explore the groundbreaking study by Xun Liu, Feng Li, and Wei Xiao, which applies an information-theoretic approach using Kolmogorov complexity to analyze Chinese linguistic complexity. With a corpus of 60 million characters, their research reveals significant correlations and insights into morpheme richness and topic prominence. Discover how this approach compares to nine European languages and sheds light on the proficiency of Chinese L1/L2 speakers!

00:00
00:00
Playback language: English
Abstract
This study applies an information-theoretic approach, Kolmogorov complexity, to measure Chinese linguistic complexity. A corpus of approximately 60 million characters was used to calculate morphological, syntactical, and overall Kolmogorov complexity metrics, along with 18 other existing metrics. Results show significant correlations between the Kolmogorov metrics and the existing metrics, indicating reliability. Comparisons with nine European languages and Chinese L1/L2 speakers of varying proficiencies demonstrate the validity of the Kolmogorov approach in capturing key linguistic features of Chinese, such as morpheme richness and topic prominence.
Publisher
Humanities and Social Sciences Communications
Published On
Jul 30, 2024
Authors
Xun Liu, Feng Li, Wei Xiao
Tags
Kolmogorov complexity
Chinese linguistics
morphological metrics
syntactical metrics
language comparison
language proficiency
linguistic features
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny