Brazilian Computer Society. Journal | |
Free tools and resources for Brazilian Portuguese speech recognition | |
Carlos Patrick1  Isabel Trancoso2  Nelson Neto2  Aldebaro Klautau4  | |
[1] , BeléFederal University of ParáIST/INESC-ID, Lisbon, Portugal;m, Brazil | |
关键词: Speech recognition; Brazilian Portuguese; Grapheme-to-phone conversion; Application programming interface; Speech-based applications; | |
DOI : 10.1007/s13173-010-0023-1 | |
学科分类:农业科学(综合) | |
来源: Springer U K | |
![]() |
【 摘 要 】
An automatic speech recognition system has modules that depend on the language and, while there are many public resources for some languages (e.g., English and Japanese), the resources for Brazilian Portuguese (BP) are still limited. This work describes the development of resources and free tools for BP speech recognition, consisting of text and audio corpora, phonetic dictionary, grapheme-to-phone converter, language and acoustic models. All of them are publicly available and, together with a proposed application programming interface, have been used for the development of several new applications, including a speech module for the OpenOffice suite. Performance tests are presented, comparing the developed BP system with a commercial software. The paper also describes an application that uses synthesis and speech recognition together with a natural language processing module dedicated to statistical machine translation. This application allows the translation of spoken conversations from BP to English and vice versa. The resources make easier the adoption of BP speech technologies by other academic groups and industry.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201902193353055ZK.pdf | 732KB | ![]() |