This literature survey focuses on various methodologies and frameworks that have been developed for speech recognition, accent recognition, and dialect translation. The research is aimed at building a comprehensive understanding of how existing technologies like Wav2Vec, SpeechBrain, and hybrid CTC/Attention models can be applied to create a speech converter API that translates colloquial Malayalam speech into standard Malayalam text and further into English. The goal of this API is to bridge communication gaps caused by the linguistic diversity within Malayalam dialects, thus improving accessibility in fields such as education, social media, and public services.
Keywords:
Subject:
Computer Science and Mathematics - Artificial Intelligence and Machine Learning
Language: From Hearing to Speech and Writing
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Alerts
Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.