学位论文

【摘要】

With the fast growing of deep neural network models, more and more tasks have been boosted when move on to deep models. Speech processing applications, e.g., speech enhancement, speech bandwidth expansion, dereverberataion, and etc., are also benefited. Most deep models focus more on improving the estimation of the spectral magnitude. However, there are evidences showing that the phase spectra are as well informative. Therefore, this dissertation investigates practical approaches to recover the spectral phase by resolving two inconsistency issues, i.e., frame-length inconsistency and frame-overlap inconsistency, leveraging the success of convex programming and alternating projection, respectively. Furthermore, frameworks to integrate both of the methods are explored. The proposed approaches and frameworks, taking advantage of some speech signal characteristics, have very limited number of assumptions, and therefore can be applied to various speech processing tasks.

【预览】

附件列表
Files	Size	Format	View
Some new applications of phase information to speech processing	11725KB	PDF	download


Some new applications of phase information to speech processing
Speech processing;Phase;Deep neural network
Li, Kehuang ; Lee, Chin-Hui Electrical and Computer Engineering Juang, Biing Hwang Clements, Mark Li, Geoffery Xie, Yao ; Lee, Chin-Hui
University:Georgia Institute of Technology
Department:Electrical and Computer Engineering
关键词: Speech processing; Phase; Deep neural network;
Others : https://smartech.gatech.edu/bitstream/1853/62636/1/LI-DISSERTATION-2019.pdf
美国\|英语
来源: SMARTech Repository
PDF


	文献评价指标
	下载次数：19次	浏览次数：14次

【 摘 要 】

【 预 览 】

【摘要】

【预览】