Mathematics
Elitism in mathematics and inequality
H. H. Chang and F. Fu
The study investigates whether the Fields Medal fulfills its original aim to elevate underrepresented mathematicians or predominantly rewards an entrenched elite. It situates mathematics within sociological frameworks of elitism and bias (e.g., Matthew and Matilda effects), and examines how prize-giving shapes the definition of “elite.” Using the Mathematics Genealogy Project (MGP), the authors explore mentorship networks surrounding Fields Medalists and how lingo-ethnic identity and national contexts influence access to elite recognition. The purpose is to characterize structural and relational mechanisms of elitism beyond citation practices and to evaluate pluralism and mobility within the elite circles of mathematics.
Foundational sociological work (Merton; de Solla Price) described “invisible colleges” and biases in science, including Matthew and Matilda effects (Rossiter). Zuckerman’s analysis of Nobel laureates introduced “accumulation of advantage,” showing early-life and institutional factors that reinforce elite status. Network-science approaches have since analyzed collaboration networks, hiring practices, and departmental prestige (e.g., Zeng et al.; Clauset et al.; Myers et al.). Within mathematics, Rossi et al. proposed a genealogy index linking mentorship and success; Gargiulo et al. enriched MGP to study historical evolution by country and discipline; Malmgren et al. examined mentorship effects on protégés’ performance. Prize-focused studies (Ma & Uzzi; Wagner et al.) tied network structures to multiple prize winners, but typically via co-authorship networks and without addressing ethnicity or gender. The present work integrates genealogical networks with lingo-ethnic inference to address identity-linked access to elite status in mathematics.
Data and graph construction: The authors use the Mathematics Genealogy Project (MGP) database (≈244,792 mathematicians; 264,263 advisor–advisee edges). Nodes are mathematicians; directed edges point from advisor to advisee. Fields Medalists are identified, and pairwise shortest paths between medalists are computed. The elite subgroup is defined as the union of these shortest paths, yielding a fully connected, minimal graph connecting all medalists (Elite nodes: 1,804; Elite edges: 2,419; Fields Medalists: 60). Analyses performed in NetworkX. Attribute data include PhD year, institution, and country of degree (mostly complete for elite; missing values hand-validated).
Identity classification: Because MGP lacks identity metadata, the authors infer lingo-ethnic identity using ethnicolr, an LSTM-based name classifier trained on Wikipedia and U.S. Census data (reported accuracy ~78–81% across 12 categories). The Wikipedia-trained model was chosen for broader international coverage. A “power ratio” (PR) quantifies over- or under-representation by identity at top institutions: PR = P(Fields Medalist ∩ Identity at institution) / P(Fields Medalist). Top institutions are the 50 most prominent within the elite community.
Flow analysis: To study interactions between groups, mathematicians are aggregated into meso-categories by country (of doctoral degree) and by inferred lingo-ethnic identity. Edges between meso-categories inherit directionality from advisor–advisee links; edge weights count the number of such relations. For countries, advisor movement implies migration if the advisor’s PhD country differs from their advisee’s PhD country; for identities, flows capture mentorship across identity categories.
Ternary diagram construction: From the meso-network adjacency matrix M, self-flow SF_i = M_ii; in-flow IF_i = sum over j≠i of M_ij; out-flow OF_i = sum over j≠i of M_ji. Values are normalized (IF_i + OF_i + SF_i = 1) and represented as points on a simplex, then mapped to 2D via planar rotations for visualization. These metrics summarize each meso-category’s mentoring dynamics (propensity to mentor within identity, to mentor others, or to receive mentoring from others).
- Scale and elite extraction: From 244,792 mathematicians and 264,263 edges, the elite network (union of shortest paths among 60 Fields Medalists) contains 1,804 mathematicians and 2,419 edges.
- Historical migration patterns: Pre-WWII elite concentrations in France and Germany; Japanese elites trained in Germany during Meiji modernization and returned to Japan. The Holocaust precipitated major outflows from Germany to the U.S. and Europe by 1932. Large outflows from Russia after the Cold War; post-WWII Italian diaspora.
- Net flow among key countries: France is a major net exporter of elite mathematicians to the U.S.; the USA–Germany chord indicates net U.S. export to Germany; generally, the U.S. exports more to most countries but imports a greater share of elites compared with the general population.
- Flow structure at the elite level vs general case: At the elite level, the U.S. shows high self-flow and in-flow (contrasting with high self-flow and out-flow in the general MGP population). Many countries are net importers of elites; traditional strongholds (Western Europe, former Soviet Union) display high self-flow and export.
- Pluralism trends: Germany maintains consistently high elite pluralism except during WWII; Japan shows recent increases in non-Japanese elites; U.S. pluralism and elite volume increase over time; Russian elite presence declines post-Soviet dissolution. Pluralism measures are lower bounds due to name ambiguity (e.g., Anglophone names spanning multiple ethnicities).
- Representation by identity (all vs elite community vs medalists): Overrepresented at elite/medalist levels: British/Anglo, French, Japanese, East European, Nordic. Underrepresented: East Asian and Germanic; Arabic names are virtually absent among medalists and underrepresented in the elite. Examples: French medalists ≈14% vs ≈8% in general; East Asian ≈14% in general but ≈5% among medalist community and ≈5% among medalists.
- Identity-based flow dynamics: Japanese identities show the strongest self-flow at elite levels (contrasting with non-self-flowing pattern when considering all mathematicians), suggesting elite-level reinforcement. Aggregated European identities exhibit high self-reinforcing behavior, whereas Asian and African/Arabic identities show fewer self-loops, countering assumptions that minority groups cluster via homophily.
- Elite genealogy concentration: All 60 Fields Medalists trace to 9 connected advisor lineages; the largest component contains 44/60 medalists, rooted in lineages from Leibniz and d’Alembert (including figures like Poisson, Hilbert, Schwartz). Within five generations after Laurent Schwartz, seven Fields Medalists appear (e.g., the Schwartz–Grothendieck–Deligne lineage).
- Example inferred country–identity pairings among elites since 1990 (sample): United States–Anglo (85), United States–East Asian (45), United Kingdom–Anglo (32), United States–East European (32), France–French (30), Russia–East European (19), United States–Indian (18), United States–Italian (18).
- Case study—Japan: Post-WWII reintegration efforts via ICM/IMU and inclusive policies correspond with increased density of elite mathematicians in Japan; Japan has produced three Fields Medalists and exhibits strong elite-level self-reinforcement.
The findings reveal that elite recognition in mathematics is shaped by relational structures in mentorship genealogies and by prize-giving practices. While the Fields Medal can reinforce existing elite lineages—evidenced by the concentration of medalists in a few genealogical components and strong self-flow among traditional strongholds—it has also contributed to integrating marginalized national communities, exemplified by Japan’s post-WWII rise within elite networks. Pluralism has increased in major countries, particularly the U.S. and Germany, yet persistent underrepresentation of Arabic, African, and East Asian identities at the medalist and elite levels indicates barriers remain. The U.S. simultaneously exports broadly and disproportionately attracts elites, reflecting its institutional pull at the highest levels. Identity-based flow patterns suggest that elite reinforcement (self-flow) is not uniformly driven by homophily across minority groups; rather, structural opportunities and institutional histories determine whether identities self-reinforce at elite tiers. Overall, the evidence supports a nuanced view: prize committees and international bodies can reshape definitions of elite inclusion, but without deliberate attention, entrenched genealogical and institutional advantages perpetuate inequality.
This study integrates network analysis of the Mathematics Genealogy Project with neural name-based identity inference to characterize elitism in mathematics around the Fields Medal. It documents both self-reinforcing elite structures (large genealogical components producing many medalists) and instances where prize-giving and international coordination aided integration (notably Japan after WWII). Despite increased pluralism in some countries, Arabic, African, and East Asian identities remain underrepresented at elite and medalist levels. The approach demonstrates how academic genealogy can serve as a diagnostic for equity and systemic bias, informing committees and institutions aiming to broaden access to elite recognition. Future work could refine identity inference with richer, consent-based metadata, extend analyses to other disciplines and prizes, and model policy interventions (e.g., committee practices, mobility programs) to evaluate their effects on pluralism and elite accessibility.
Identity labels are inferred from names via an LSTM classifier (≈78–81% accuracy across 12 categories), risking misclassification, especially across culturally overlapping naming conventions; this yields conservative pluralism estimates (e.g., Anglophone names spanning diverse ethnicities). MGP coverage may carry Western biases, affecting baseline representation. The elite definition is social (shortest-path union connecting Fields Medalists) rather than performance-based, which may omit influential non-medalist elites. Some attribute data required manual validation, and country-level flow interpretation conflates advisor and student migration. Visualizations (e.g., ternary diagrams) summarize complex dynamics and may obscure temporal heterogeneity.
Related Publications
Explore these studies to deepen your understanding of the subject.

