logo
ResearchBunny Logo
Deciphering microbial gene function using natural language processing

Biology

Deciphering microbial gene function using natural language processing

D. Miller, A. Stern, et al.

Discover the cutting-edge research by Danielle Miller, Adi Stern, and David Burstein, which utilizes deep learning techniques inspired by natural language processing to unveil the functions of uncharacterized microbial genes. Their innovative method achieved remarkable accuracy, particularly in identifying novel defense systems, and has opened new avenues in microbial interaction and defense research.... show more
Abstract
Revealing the function of uncharacterized genes is a fundamental challenge in an era of ever-increasing volumes of sequencing data. Here, we present a concept for tackling this challenge using deep learning methodologies adopted from natural language processing (NLP). We repurpose NLP algorithms to model "gene semantics" based on a biological corpus of more than 360 million microbial genes within their genomic context. We use the language models to predict functional categories for 56,617 genes and find that out of 1369 genes associated with recently discovered defense systems, 98% are inferred correctly. We then systematically evaluate the "discovery potential" of different functional categories, pinpointing those with the most genes yet to be characterized. Finally, we demonstrate our method's ability to discover systems associated with microbial interaction and defense. Our results highlight that combining microbial genomics and language models is a promising avenue for revealing gene functions in microbes.
Publisher
Nature Communications
Published On
Sep 29, 2022
Authors
Danielle Miller, Adi Stern, David Burstein
Tags
microbial genes
deep learning
natural language processing
gene embeddings
functional categories
defense systems
genomics
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny