Targeted high-throughput DNA sequencing is crucial for genomics and molecular diagnostics. Oligonucleotide probes used for enrichment exhibit varying hybridization kinetics, leading to non-uniform coverage. This paper introduces a deep learning model (DLM) to predict Next-Generation Sequencing (NGS) depth from DNA probe sequences. The DLM, incorporating a bidirectional recurrent neural network, uses DNA nucleotide identities and unpaired nucleotide probabilities. Evaluated on three NGS panels (SNP, lncRNA, and synthetic), the DLM accurately predicts sequencing depth, showing high accuracy in cross-validation and independent testing. The model also effectively predicts DNA hybridization and strand displacement kinetic rate constants.
Publisher
Nature Communications
Published On
Jul 19, 2021
Authors
Jinny X. Zhang, Boyan Yordanov, Alexander Gaunt, Michael X. Wang, Peng Dai, Yuan-Jyue Chen, Kerou Zhang, John Z. Fang, Neil Dalchau, Jiaming Li, Andrew Phillips, David Yu Zhang
Tags
DNA sequencing
deep learning
hybridization kinetics
next-generation sequencing
genomics
molecular diagnostics
Related Publications
Explore these studies to deepen your understanding of the subject.