Volume: 1 Issue: 1
Year: 2024, Page: 15-19,
Received: Feb. 22, 2024 Accepted: May 10, 2024 Published: May 22, 2024
Machine learning algorithms are being studied to develop algorithms that can recognize and segment languages in audio recordings. This technology has great potential to improve our ability to communicate and understand different language communities. The main goal of multilingual recognition is to develop models that can accurately recognize spoken language. This is especially useful in applications such as call centers and voice assistants. Speech patterns found in online podcasts, audiobooks and its variants in Speech Corpus. This corpus contains utterances and each takes an equal time of 10 seconds. The entire corpus is divided into two parts, a large object as a training data set and a small one as a test set. Thus, an acoustic model that uses the mean values of the BFCC appears to be an appropriate method for speech recognition. The system uses Convolutional K Nearest Neighbors (KNN) to solve the multiple classification problem. The aim of the project is to know Punjabi, Hindi and Gujarati.
Keywords: Multilingual Spoken Language Recognition Using Machine Learning Algorithms
Rao L. Multiclass Spoken Language Identification for Indian Languages using Deep Learning .
Das, Shekhar H, Roy P. A deep dive into Deep learning techniques for solving spoken language identification problems. (pp. 81-100) Academic press. 2019.
Sharma N, Jain V, Mishra A. An analysis of CNN for Image classification. Procedia computer science. 2018;132:377–384.
Kaz Z. Sentence Level Language Identification in Gujarati, hindi . .
Kim H, Park JS. Automatic Language Identification Using Speech Rhythm Features for Multi-Lingual Speech Recognition. Applied Sciences. 10(7).
Padi B, Mohan A, Ganapathy S. Towards Relevance and Sequence Modeling in Language Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 2020;28:1223–1232.
Verma M, Buduru AB. Fine-grained Language Identification with Multilingual CapsNet Model. In: 2020 IEEE Sixth International Conference on Multimedia Big Data (BigMM). (pp. 94-102) IEEE. 2020.
Barnard E, Cole RA. Reviewing automatic languageidentification. IEEE Signal Processing Magazine. .
Waibel A, Geutner P, Tomokiyo LM, Schultz T, Woszczyna M. Multilinguality in speech and spoken language systems. Proceedings of the IEEE. 2000;88(8):1297–1313.
© 2024 Gore et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Gore PN, Tukaram Bawadane S, Sanjay Ingale S, Watve SG. (2024). Multilingual Spoken Language Recognition Using Machine Learning Algorithms. International Journal of Electronics and Computer Applications. 1(1): 15-19.