logo
ResearchBunny Logo
Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter

Biology

Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter

A. Hoarfrost, A. Aptekmann, et al.

Discover the groundbreaking work of A. Hoarfrost, A. Aptekmann, G. Farfañuk, and Y. Bromberg as they unveil LookingGlass, a deep learning model that provides crucial insights into uncultured microbial genomes. This innovative approach identifies novel enzymes and predicts their optimal conditions, illuminating the mysteries of microbial dark matter.

00:00
00:00
~3 min • Beginner • English
Abstract
The majority of microbial genomes have yet to be cultured, and most proteins identified in microbial genomes or environmental sequences cannot be functionally annotated. As a result, current computational approaches to describe microbial systems rely on incomplete reference databases that cannot adequately capture the functional diversity of the microbial tree of life, limiting our ability to model high-level features of biological sequences. Here we present LookingGlass, a deep learning model encoding contextually-aware, functionally and evolutionarily relevant representations of short DNA reads, that distinguishes reads of disparate function, homology, and environmental origin. We demonstrate the ability of LookingGlass to be fine-tuned via transfer learning to perform a range of diverse tasks: to identify novel oxidoreductases, to predict enzyme optimal temperature, and to recognize the reading frames of DNA sequence fragments. LookingGlass enables functionally relevant representations of otherwise unknown and unannotated sequences, shedding light on the microbial dark matter that dominates life on Earth.
Publisher
Nature Communications
Published On
Nov 16, 2022
Authors
A. Hoarfrost, A. Aptekmann, G. Farfañuk, Y. Bromberg
Tags
deep learning
microbial genomes
oxidoreductases
transfer learning
functional annotations
DNA sequences
enzyme prediction
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny