logo
ResearchBunny Logo
Abstract
Data provides a foundation for machine learning, which has accelerated data-driven materials design. The scientific literature contains a large amount of high-quality, reliable data, and automatically extracting data from the literature continues to be a challenge. We propose a natural language processing pipeline to capture both chemical composition and property data that allows analysis and prediction of superalloys. Within 3 h, 2531 records with both composition and property are extracted from 14,425 articles, covering γ solvus temperature, density, solidus, and liquidus temperatures. A data-driven model for γ solvus temperature is built to predict unexplored Co-based superalloys with high γ solvus temperatures within a relative error of 0.81%. We test the predictions via synthesis and characterization of three alloys. A web-based toolkit as an online open-source platform is provided and expected to serve as the basis for a general method to search for targeted materials using data extracted from the literature.
Publisher
npj Computational Materials
Published On
Jan 19, 2022
Authors
Weiren Wang, Xue Jiang, Shaohan Tian, Pei Liu, Depeng Dang, Yanjing Su, Turab Lookman, Jianxin Xie
Tags
Machine Learning
Data Extraction
Superalloys
Natural Language Processing
Predictive Modeling
Material Design
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs—just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny