logo
ResearchBunny Logo
Leveraging Language Representation for Materials Exploration and Discovery

Engineering and Technology

Leveraging Language Representation for Materials Exploration and Discovery

J. Qu, Y. R. Xie, et al.

Discover a groundbreaking materials discovery framework that employs natural language embeddings from language models to enhance the exploration of crystal structures. This innovative approach allows for similarity analysis and multi-task learning, paving the way for diverse prototype structures in thermoelectrics. This exciting research was conducted by Jiaxing Qu, Yuxuan Richard Xie, Kamil M. Ciesielski, Claire E. Porter, Eric S. Toberer, and Elif Ertekin.

00:00
00:00
~3 min • Beginner • English
Abstract
Data-driven approaches to materials exploration and discovery are building momentum due to emerging advances in machine learning. However, parsimonious representations of crystals for navigating the vast materials search space remain limited. To address this limitation, we introduce a materials discovery framework that utilizes natural language embeddings from language models as representations of compositional and structural features. The contextual knowledge encoded in these language representations conveys information about material properties and structures, enabling both similarity analysis to recall relevant candidates based on a query material and multi-task learning to share information across related properties. Applying this framework to thermoelectrics, we demonstrate diversified recommendations of prototype crystal structures and identify under-studied material spaces. Validation through first-principles calculations and experiments confirms the promise of the framework for materials design as high-performance thermoelectrics. Language-based representations offer versatile and adaptable approaches to materials exploration.
Publisher
Nature Materials
Published On
Mar 21, 2024
Authors
Jiaxing Qu, Yuxuan Richard Xie, Kamil M. Ciesielski, Claire E. Porter, Eric S. Toberer, Elif Ertekin
Tags
materials discovery
natural language embeddings
crystal structures
thermoelectrics
property prediction
similarity analysis
first-principles calculations
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny