logo
ResearchBunny Logo
Deciphering microbial gene function using natural language processing

Biology

Deciphering microbial gene function using natural language processing

D. Miller, A. Stern, et al.

Discover the cutting-edge research by Danielle Miller, Adi Stern, and David Burstein, which utilizes deep learning techniques inspired by natural language processing to unveil the functions of uncharacterized microbial genes. Their innovative method achieved remarkable accuracy, particularly in identifying novel defense systems, and has opened new avenues in microbial interaction and defense research.

00:00
00:00
Playback language: English
Abstract
This paper presents a novel approach for identifying the function of uncharacterized microbial genes using deep learning techniques adapted from natural language processing (NLP). The researchers created a biological corpus of over 360 million microbial genes, modeling gene families as 'words' and genomic regions as 'sentences'. Using this corpus, they trained a word2vec model to generate gene embeddings, which were then used to predict functional categories for 56,617 genes. The method demonstrated high accuracy, particularly for recently discovered defense systems (98% accuracy). Furthermore, the researchers identified functional categories with high 'discovery potential' and showcased its application in discovering systems associated with microbial interaction and defense, highlighting the potential of this combined genomics-NLP approach.
Publisher
Nature Communications
Published On
Sep 29, 2022
Authors
Danielle Miller, Adi Stern, David Burstein
Tags
microbial genes
deep learning
natural language processing
gene embeddings
functional categories
defense systems
genomics
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny