logo
ResearchBunny Logo
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models

Computer Science

Deepfake audio as a data augmentation technique for training automatic speech to text transcription models

A. R. Ferreira and C. E. C. Campelo

Discover a novel data augmentation framework using deepfake audio to enhance automatic speech-to-text models, particularly for less-popular languages. This groundbreaking research, conducted by Alexandre R Ferreira and Cláudio E C Campelo, reveals the impact of deepfake audio quality on model performance, paving the way for improved ASR systems.

00:00
00:00
Playback language: English
Citation Metrics
Citations
0
Influential Citations
0
Reference Count
0

Note: The citation metrics presented here have been sourced from Semantic Scholar and OpenAlex.

Listen, Learn & Level Up
Over 10,000 hours of research content in 25+ fields, available in 12+ languages.
No more digging through PDFs, just hit play and absorb the world's latest research in your language, on your time.
listen to research audio papers with researchbunny