logo
ResearchBunny Logo
An end-to-end deep learning framework for translating mass spectra to de-novo molecules

Chemistry

An end-to-end deep learning framework for translating mass spectra to de-novo molecules

E. E. Litsa, V. Chenthamarakshan, et al.

Discover how Eleni E. Litsa and her team have developed Spec2Mol, a groundbreaking deep learning model that decodes mass spectra into molecular structures. This innovative approach outperforms traditional methods, paving the way for identifying novel molecules and advancing chemical research.

00:00
00:00
~3 min • Beginner • English
Abstract
Elucidating the structure of a chemical compound is a fundamental task in chemistry with applications in multiple domains including drug discovery, precision medicine, and biomarker discovery. The common practice for elucidating the structure of a compound is to obtain a mass spectrum and subsequently retrieve its structure from spectral databases. However, these methods fail for novel molecules that are not present in the reference database. We propose Spec2Mol, a deep learning architecture for molecular structure recommendation given mass spectra alone. Spec2Mol is inspired by the Speech2Text deep learning architectures for translating audio signals into text. Our approach is based on an encoder-decoder architecture. The encoder learns the spectra embeddings, while the decoder, pre-trained on a massive dataset of chemical structures for translating between different molecular representations, reconstructs SMILES sequences of the recommended chemical structures. We have evaluated Spec2Mol by assessing the molecular similarity between the recommended structures and the original structure. Our analysis showed that Spec2Mol is able to identify the presence of key molecular substructures from its mass spectrum, and shows on par performance, when compared to existing fragmentation tree methods particularly when test structure information is not available during training or present in the reference database.
Publisher
Communications Chemistry
Published On
Jun 23, 2023
Authors
Eleni E. Litsa, Vijil Chenthamarakshan, Payel Das, Lydia E. Kavraki
Tags
mass spectra
molecular structure
deep learning
spectral databases
SMILES sequences
encoder-decoder architecture
chemical compounds
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny