会议论文详细信息
International Conference on Information Technology and Digital Applications 2018
Online corpus of spoken Ilokano language
计算机科学;无线电电子学
Apostol, F.^1 ; Malicdem, A.^2
Mariano Marcos State University, Ilocos Norte, Philippines^1
Don Mariano Marcos Memorial State University, La Union, Philippines^2
关键词: Automatic speech recognition;    NAtural language processing;    Online corpora;    Online repositories;    Philippines;    WEB application;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/482/1/012034/pdf
DOI  :  10.1088/1757-899X/482/1/012034
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

There has been a great effort in the collection of different languages in the past years all over the world, and the development of online corpus outside the country brought new possibilities in the Philippines. However, there is a limited resource for the Ilokano Language. This paper introduces the Corpus of Spoken Ilokano Language, an online repository of spoken Ilokano in the Philippines specifically in region 1. The main component of this study is spoken Ilokano. It has been specifically built for natural language processing. It shows the difference of Ilokano language as spoken by Ilokanos in the region. The database consists of 160 speakers, 40 speakers in each province of the region, each speaking about 74 statements. Spoken Ilokano language was audio recorded and transcribed. A web application has been developed making the dataset available online. The corpus was validated to provide a useful resource of data that can be used for automatic speech recognition models.

【 预 览 】
附件列表
Files Size Format View
Online corpus of spoken Ilokano language 1403KB PDF download
  文献评价指标  
  下载次数:24次 浏览次数:23次