logo
ResearchBunny Logo
Open source and reproducible and inexpensive infrastructure for data challenges and education

Medicine and Health

Open source and reproducible and inexpensive infrastructure for data challenges and education

P. E. Dewitt, M. A. Rebull, et al.

Unlock the potential of research data sharing with a groundbreaking study by Peter E. DeWitt, Margaret A. Rebull, and Tellen D. Bennett. Discover how a cost-effective, reproducible workflow was created using GitHub, open-source languages, and Docker to democratize data challenges in pediatric traumatic brain injury. Take a look at the results of their innovative approach!

00:00
00:00
~3 min • Beginner • English
Abstract
Data sharing is necessary to maximize the actionable knowledge generated from research data. Data challenges can encourage secondary analyses of datasets. Data challenges in biomedicine often rely on advanced cloud-based computing infrastructure and expensive industry partnerships. Examples include challenges that use Google Cloud virtual machines and the Sage Bionetworks Dream Challenges platform. Such robust infrastructures can be financially prohibitive for investigators without substantial resources. Given the potential to develop scientific and clinical knowledge and the NIH emphasis on data sharing and reuse, there is a need for inexpensive and computationally lightweight methods for data sharing and hosting data challenges. To fill that gap, we developed a workflow that allows for reproducible model training, testing, and evaluation. We leveraged public GitHub repositories, open-source computational languages, and Docker technology. In addition, we conducted a data challenge using the infrastructure we developed. In this manuscript, we report on the infrastructure, workflow, and data challenge results. The infrastructure and workflow are likely to be useful for data challenges and education.
Publisher
Scientific Data
Published On
Jan 02, 2024
Authors
Peter E. DeWitt, Margaret A. Rebull, Tellen D. Bennett
Tags
data sharing
reproducible workflow
GitHub
Docker
pediatric traumatic brain injury
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny