期刊论文详细信息
Kurdistan Journal of Applied Research
Performance Analysis: AI-based VIST Audio Player by Microsoft Speech API
Ribwar Bakhtyar Ibrahim1 
[1] Database Technology, College of Informatics, Sulaimani Polytechnic University, Sulaimani, Iraq;
关键词: speech recognition, microsoft speech api, subtitles, speech to text, speech-to-text recognition, artificial intelligence. a voice interactive speech to text (vist).microsoft speech api.;   
DOI  :  10.24017/science.2021.1.3
来源: DOAJ
【 摘 要 】

Speech recognition has gained much attention from researchers for almost last two decades. Isolated words, connected words, and continuous speech are the main focused areas of speech recognition. Researchers have adopted many techniques to solve speech recognition challenges under the umbrella of Artificial Intelligence (AI), Pattern Recognition and Acoustic Phonetic approaches. Variation in pronunciation of words, individual accents, unwanted ambient noise, speech context, and quality of input devices are some of these challenges in speech recognition. Many Application Programming Interface (API)s are developed to overcome the issue of accuracy in a speech-to-text conversion such as Microsoft Speech API and Google Speech API. In this paper, the performance of Microsoft Speech API is analyzed against other Speech APIs mentioned in the literature on the special dataset (without background noise) prepared. A Voice Interactive Speech to Text (VIST) audio player was developed for the analysis of Microsoft Speech API. VIST audio player creates runtime subtitles of the audio files running on it; the player is responsible for speech to text conversion in real-time. Microsoft Speech API was incorporated in the application to validate and make the performance of API measurable. The experiments proved the Microsoft Speech API more accurate with respect to other APIs in the context of the prepared dataset for the VIST audio player. The accuracy rate according to the precision-recall is 96% for Microsoft Speech API, which is better than previous ones as mentioned in the literature.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次