logo
ResearchBunny Logo
Multilingual translation for zero-shot bio-medical classification using BioTranslator

Biology

Multilingual translation for zero-shot bio-medical classification using BioTranslator

H. Xu, A. Woicik, et al.

Discover BioTranslator, a groundbreaking multilingual translation method developed by Hanwen Xu, Addie Woicik, Hoifung Poon, Russ B. Altman, and Sheng Wang. This innovative tool enables scientists to move beyond controlled vocabularies by translating text descriptions of new biological concepts into actionable data instances, facilitating the identification of novel cell types, predicting protein functions, and locating drug targets with ease.... show more
Abstract
Existing annotation paradigms rely on controlled vocabularies, where each data instance is classified into one term from a predefined set of controlled vocabularies. This paradigm restricts the analysis to concepts that are known and well-characterized. Here, we present the novel multilingual translation method BioTranslator to address this problem. BioTranslator takes a user-written textual description of a new concept and then translates this description to a non-text biological data instance. The key idea of Bio-Translator is to develop a multilingual translation framework, where multiple modalities of biological data are all translated to text. We demonstrate how BioTranslator enables the identification of novel cell types using only a textual description and how BioTranslator can be further generalized to protein function prediction and drug target identification. Our tool frees scientists from limiting their analyses within predefined controlled vocabularies, enabling them to interact with biological data using free text.
Publisher
Nature Communications
Published On
Feb 10, 2023
Authors
Hanwen Xu, Addie Woicik, Hoifung Poon, Russ B. Altman, Sheng Wang
Tags
BioTranslator
biological data
multilingual translation
novel concepts
protein function
drug target identification
free-text interaction
Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny