学位论文

【摘要】

Audio source separation is a well-known problem in the speech community. Many methods have been proposed to isolate speech signals from a multichannel mixture. In this thesis, we will explore a number of techniques involving interchannel phase difference (IPD) features within a tensor factorization framework. IPD features can be extracted on a time-frequency (TF) grid and are a function of the phase characteristics of the mixing process. Thus, the ultimate goal is to form a clustering of these features and produce TF masks that can be used to perform the separation. We discuss various non-tensor-based methods that are capable of modeling linear and nonlinear IPD trends. Then, we discuss generalizations to both nonnegative and complex tensor factorizations (NTF, CTF). We show that each method performs best in certain circumstances and we conclude by saying that more work is needed to devise a generally superior approach.

【预览】

附件列表
Files	Size	Format	View
Phase difference and tensor factorization models for audio source separation	12169KB	PDF	download


Phase difference and tensor factorization models for audio source separation
Nonnegative matrix factorization;Nonnegative tensor factorization;Interchannel phase differences;Audio Source Separation
Traa, Johannes
关键词: Nonnegative matrix factorization; Nonnegative tensor factorization; Interchannel phase differences; Audio Source Separation;
Others : https://www.ideals.illinois.edu/bitstream/handle/2142/95277/TRAA-DISSERTATION-2016.pdf?sequence=1&isAllowed=y
美国\|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF


	文献评价指标
	下载次数：7次	浏览次数：17次

【 摘 要 】

【 预 览 】

【摘要】

【预览】