Medicine and HealthNature Communications
Data leakage inflates prediction performance in connectome-based machine learning models
M. Rosenblatt, L. Tejavibulya, et al.
This research by Matthew Rosenblatt, Link Tejavibulya, Rongtao Jiang, Stephanie Noble, and Dustin Scheinost delves into the critical issue of data leakage in neuroimaging predictive modeling. By examining five types of leakage across four datasets, the study unveils how feature selection and repeated subject leakage can dramatically skew prediction outcomes, particularly in smaller datasets. Discover the nuances of leakage's impact and its significance for achieving valid results!
Related Publications
Explore these studies to deepen your understanding
Adjacent work that informs or extends this paper's methodology and findings.
Medicine and Health
Network-based machine learning in colorectal and bladder organoid models predicts anti-cancer drug efficacy in patients
J. Kong, H. Lee, et al.
Education
Driving STEM learning effectiveness: dropout prediction and intervention in MOOCs based on one novel behavioral data analysis approach
X. Xia and W. Qi
Psychology
Building machine learning prediction models for well-being using predictors from the exposome and genome in a population cohort
D. H. M. Pelt, P. C. Habets, et al.
Medicine and Health
Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
Y. Takahashi, M. Ueki, et al.

