Enhanced speech recognition in natural language processing
Siu-Hong Chang, Kok-Why Ng, Su-Cheng Haw, Yih-Jian Yoong
Abstract
Speech recognition is crucial for helping individuals with physical disabilities access digital content. However, current systems have significant flaws that hinder user experience and complicate daily tasks. Environmental disturbances can cause misinterpretation, and existing automatic speech recognition (ASR) systems struggle with comprehending acoustic and linguistic nuances and handling diverse speaking styles and accents. To address these issues, a new model integrates bidirectional encoder representations from transformers (BERT) and transformer features with natural language processing (NLP) capabilities. This model aims to consolidate semantic, linguistic, and acoustic information extracted from the Kaldi speech recognition toolkit and improve accuracy by rescoring the list of N-best hypotheses. The innovative approach leverages advancements in NLP to enhance speech recognition's accuracy and robustness across various scenarios. Evaluations on the LibriSpeech dataset show that integrating BERT, transformer encoder, and generative pretrained transformer 2 for rescoring N-best hypotheses significantly improves transcription accuracy. The proposed model achieves a word error rate (WER) of 17.98%, outperforming other models. This development paves the way for advancements in speech recognition technology, offering better user experiences in real-world applications.
Keywords
Automatic speech recognition; Bidirectional encoder representations from transformers; GPT-2; Natural language processing; Transformer
DOI:
https://doi.org/10.11591/eei.v14i6.9539
Refbacks
There are currently no refbacks.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License .
<div class="statcounter"><a title="hit counter" href="http://statcounter.com/free-hit-counter/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/10241695/0/5a758c6a/0/" alt="hit counter"></a></div>
Bulletin of EEI Stats
Bulletin of Electrical Engineering and Informatics (BEEI) ISSN: 2089-3191 , e-ISSN: 2302-9285 This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU) .