Journal of Computer Science | |
Comparative Evaluation of Phone Duration Models for Greek Emotional Speech | Science Publications | |
Alexandros Lazaridis1  Vasiliki Bourna1  Nikos Fakotakis1  | |
关键词: Phone duration modeling; statistical modeling; emotional speech; text-to-speech synthesis; | |
DOI : 10.3844/jcssp.2010.341.349 | |
学科分类:计算机科学(综合) | |
来源: Science Publications | |
【 摘 要 】
Problem statement: In this study we cope with the task of phone duration modeling for Greek emotional speech synthesis. Approach: Various well established machine learning techniques are applied for this purpose to an emotional speech database consisting of five archetypal emotions. The constructed phone duration prediction models are built on phonetic, morphosyntactic and prosodic features that can be extracted only from text. We employ model and regression trees, linear regression, lazy learning algorithms and meta-learning algorithms using regression trees as base classifiers, trained on a Modern Greek emotional database consisting of five emotional categories: anger, fear, joy, neutral and sadness. Results: Model trees based on the M5
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201911300623666ZK.pdf | 98KB | download |