| International Conference on Information Technology and Digital Applications 2018 | |
| Online corpus of spoken Ilokano language | |
| 计算机科学;无线电电子学 | |
| Apostol, F.^1 ; Malicdem, A.^2 | |
| Mariano Marcos State University, Ilocos Norte, Philippines^1 | |
| Don Mariano Marcos Memorial State University, La Union, Philippines^2 | |
| 关键词: Automatic speech recognition; NAtural language processing; Online corpora; Online repositories; Philippines; WEB application; | |
| Others : https://iopscience.iop.org/article/10.1088/1757-899X/482/1/012034/pdf DOI : 10.1088/1757-899X/482/1/012034 |
|
| 学科分类:计算机科学(综合) | |
| 来源: IOP | |
PDF
|
|
【 摘 要 】
There has been a great effort in the collection of different languages in the past years all over the world, and the development of online corpus outside the country brought new possibilities in the Philippines. However, there is a limited resource for the Ilokano Language. This paper introduces the Corpus of Spoken Ilokano Language, an online repository of spoken Ilokano in the Philippines specifically in region 1. The main component of this study is spoken Ilokano. It has been specifically built for natural language processing. It shows the difference of Ilokano language as spoken by Ilokanos in the region. The database consists of 160 speakers, 40 speakers in each province of the region, each speaking about 74 statements. Spoken Ilokano language was audio recorded and transcribed. A web application has been developed making the dataset available online. The corpus was validated to provide a useful resource of data that can be used for automatic speech recognition models.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| Online corpus of spoken Ilokano language | 1403KB |
PDF