Computer ScienceJMIR Medical Informatics
Reliability of Supervised Machine Learning Using Synthetic Data in Health Care: Model to Preserve Privacy for Data Sharing
D. Rankin, M. Black, et al.
This study, conducted by Debbie Rankin, Michaela Black, Raymond Bond, Jonathan Wallace, Maurice Mulvenna, and Gorka Epelde, reveals insights about the performance of machine learning models trained on synthetic healthcare data. It shows that while synthetic data can be useful, real data still holds a significant edge in accuracy, particularly with tree-based models. The research underlines the balance between privacy and data utility.
Related Publications
Explore these studies to deepen your understanding
Adjacent work that informs or extends this paper's methodology and findings.
Medicine and Health
Predictive model of castration resistance in advanced prostate cancer by machine learning using genetic and clinical data: KYUCOG-1401-A study
M. Shiota, S. Nemoto, et al.
Computer Science
Using the interest theory of rights and Hohfeldian taxonomy to address a gap in machine learning methods for legal document analysis
A. Izzidien
Medicine and Health
Short-term local predictions of COVID-19 in the United Kingdom using dynamic supervised machine learning algorithms
X. Wang, Y. Dong, et al.
Engineering and Technology
Improved Fault Classification and Localization in Power Transmission Networks Using VAE-Generated Synthetic Data and Machine Learning Algorithms
M. A. Khan, B. Asad, et al.

