学位论文

【摘要】

The objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in estimating the VTAF from speech data is investigated. The latent variable model uses this uncertainty to generate many-to-one mapping between observations of the VTAF and speech spectra. The second uses the probabilistic model of speech production to improve the performance of traditional speech algorithms, such as enhancement, acoustic model adaptation, etc.In this thesis, we propose to model the process of speech production with a probability map. This proposed model treats speech production as a probabilistic process with many-to-one mapping between VTAF and speech spectra. The thesis not only outlines a statistical framework to generate and train these probabilistic models from speech, but also demonstrates its power and flexibility with such applications as enhancing speech from both perceptual and recognition perspectives.

【预览】

附件列表
Files	Size	Format	View
Probabilistic space maps for speech with applications	3149KB	PDF	download


Probabilistic space maps for speech with applications
Automatic bandwidth expansion;Probabilistic space maps;Statistical models;Acoustic model adaptation;Speech enhancement
Kalgaonkar, Kaustubh ; Electrical and Computer Engineering
University:Georgia Institute of Technology
Department:Electrical and Computer Engineering
关键词: Automatic bandwidth expansion; Probabilistic space maps; Statistical models; Acoustic model adaptation; Speech enhancement;
Others : https://smartech.gatech.edu/bitstream/1853/42739/1/kalgaonkar_kaustubh_p_201112_phd.pdf
美国\|英语
来源: SMARTech Repository
PDF


	文献评价指标
	下载次数：18次	浏览次数：26次

【 摘 要 】

【 预 览 】

【摘要】

【预览】