ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS
Keywords:
Uzbek ASR, TTS, dialect robustness, self-supervised pretraining, code-switching, noise augmentation, multi-script normalization, test-time adaptation.Abstract
In this article we present a unified recipe for robust Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) for Uzbek under dialectal variation, code-switching, and real-world noise. We combine self-supervised pretraining, dialect-aware lexicons, multi-script text normalization, targeted augmentation, and test-time adaptation. On simulated and field recordings, the ASR reduces WER by large margins; the TTS maintains naturalness and intelligibility across accents and SNRs.
Downloads
Published
2025-09-23
Issue
Section
Articles
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
How to Cite
ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS. (2025). American Journal of Research in Humanities and Social Sciences, 40, 1-4. https://americanjournal.org/index.php/ajrhss/article/view/3094