ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS

Authors

  • Sukhrob Avezov Sobirovich PhD, Lecturer in the Department of Russian Language and Literature Bukhara State University

Keywords:

Uzbek ASR, TTS, dialect robustness, self-supervised pretraining, code-switching, noise augmentation, multi-script normalization, test-time adaptation.

Abstract

In this article we present a unified recipe for robust Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) for Uzbek under dialectal variation, code-switching, and real-world noise. We combine self-supervised pretraining, dialect-aware lexicons, multi-script text normalization, targeted augmentation, and test-time adaptation. On simulated and field recordings, the ASR reduces WER by large margins; the TTS maintains naturalness and intelligibility across accents and SNRs.

Downloads

Published

2025-09-23

Issue

Section

Articles

How to Cite

ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS. (2025). American Journal of Research in Humanities and Social Sciences, 40, 1-4. https://americanjournal.org/index.php/ajrhss/article/view/3094