ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS

Sukhrob Avezov Sobirovich

ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS

Authors

Sukhrob Avezov Sobirovich PhD, Lecturer in the Department of Russian Language and Literature Bukhara State University

Keywords:

Uzbek ASR, TTS, dialect robustness, self-supervised pretraining, code-switching, noise augmentation, multi-script normalization, test-time adaptation.

Abstract

In this article we present a unified recipe for robust Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) for Uzbek under dialectal variation, code-switching, and real-world noise. We combine self-supervised pretraining, dialect-aware lexicons, multi-script text normalization, targeted augmentation, and test-time adaptation. On simulated and field recordings, the ASR reduces WER by large margins; the TTS maintains naturalness and intelligibility across accents and SNRs.

Downloads

Published

2025-09-23

Issue

Vol. 40 (2025)

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

How to Cite

ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS. (2025). American Journal of Research in Humanities and Social Sciences, 40, 1-4. https://americanjournal.org/index.php/ajrhss/article/view/3094