Preprint Article Version 1 This version is not peer-reviewed

Integrated Method of Deep learning and Large Language Model in Speech Recognition

Version 1 : Received: 17 July 2024 / Approved: 18 July 2024 / Online: 19 July 2024 (06:29:02 CEST)

How to cite: Guan, B.; Cao, J.; Huang, B.; Wang, Z.; Wang, X.; Wang, Z. Integrated Method of Deep learning and Large Language Model in Speech Recognition. Preprints 2024, 2024071520. https://doi.org/10.20944/preprints202407.1520.v1 Guan, B.; Cao, J.; Huang, B.; Wang, Z.; Wang, X.; Wang, Z. Integrated Method of Deep learning and Large Language Model in Speech Recognition. Preprints 2024, 2024071520. https://doi.org/10.20944/preprints202407.1520.v1

Abstract

This research aims to explore the integration method of deep learning and large language models in speech recognition to improve the system’s recognition accuracy and ability to handle complex contexts. Deep neural network (DNN), convolutional neural network (CNN), long short-term memory network (LSTM) and Transformer-based large language model are used to build an integrated acoustic and language model framework. Experiments on TIMIT, LibriSpeech and Common Voice datasets show that the ensemble model shows significant improvements in both word error rate (WER) and real-time factor (RTF) compared to traditional models. Especially in terms of adaptability to multiple languages and accent changes, the model shows superior performance. The conclusion shows that through technology integration, the performance of the speech recognition system in complex environments can be effectively improved, providing a new direction for the future development of speech recognition technology.

Keywords

deep learning; large language model; speech recognition; ensemble method

Subject

Computer Science and Mathematics, Computer Science

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.