期刊论文详细信息
Data in Brief
Development of Hausa dataset a baseline for speech recognition
Moussa Mahamat Boukar1  Umar Adam Ibrahim2  Muhammed Aliyu Suleiman2 
[1] Faculty of Natural and Applied Sciences, Computer Science Department, Nile University of Nigeria, Abuja, Nigeria;Corresponding authors.;
关键词: Corpus;    Automatic speech;    NLP;    Text-to-speech;    Hausa corpus;   
DOI  :  
来源: DOAJ
【 摘 要 】

The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次