Computer ScienceJMIR Medical Informatics
Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing
D. Rankin, M. Black, et al.
This study, conducted by Debbie Rankin, Michaela Black, Raymond Bond, Jonathan Wallace, Maurice Mulvenna, and Gorka Epelde, reveals insights about the performance of machine learning models trained on synthetic healthcare data. It shows that while synthetic data can be useful, real data still holds a significant edge in accuracy, particularly with tree-based models. The research underlines the balance between privacy and data utility.
Related Publications
Explore these studies to deepen your understanding
Adjacent work that informs or extends this paper's methodology and findings.
Computer Science
Using the interest theory of rights and Hohfeldian taxonomy to address a gap in machine learning methods for legal document analysis
A. Izzidien
Medicine and Health
Predictive model of castration resistance in advanced prostate cancer by machine learning using genetic and clinical data: KYUCOG-1401-A study
M. Shiota, S. Nemoto, et al.
Medicine and Health
Machine learning-based prediction of in-hospital death for patients with takotsubo syndrome: The InterTAK-ML model
O. D. Filippo, V. L. Cammann, et al.
Earth Sciences
A machine learning estimator trained on synthetic data for real-time earthquake ground-shaking predictions in Southern California
M. Monterrubio-velasco, S. Callaghan, et al.

