
Computer Science
Deepfake audio as a data augmentation technique for training automatic speech to text transcription models
A. R. Ferreira and C. E. C. Campelo
Discover a novel data augmentation framework using deepfake audio to enhance automatic speech-to-text models, particularly for less-popular languages. This groundbreaking research, conducted by Alexandre R Ferreira and Cláudio E C Campelo, reveals the impact of deepfake audio quality on model performance, paving the way for improved ASR systems.
Playback language: English
Related Publications
Explore these studies to deepen your understanding of the subject.