学位论文详细信息
LOW RESOURCE HIGH ACCURACY KEYWORD SPOTTING
Keyword Spotting;Low Resource;Computer Science
Chen, GuoguoKhudanpur, Sanjeev P. ;
Johns Hopkins University
关键词: Keyword Spotting;    Low Resource;    Computer Science;   
Others  :  https://jscholarship.library.jhu.edu/bitstream/handle/1774.2/40272/CHEN-DISSERTATION-2016.pdf?sequence=1&isAllowed=y
瑞士|英语
来源: JOHNS HOPKINS DSpace Repository
PDF
【 摘 要 】

Keyword spotting (KWS) is a task to automatically detect keywords of interest in continuous speech, which has been an active research topic for over 40 years. Recently there is a rising demand for KWS techniques in resource constrained conditions. For example, as for the year of 2016, USC Shoah Foundation covers audio-visual testimonies from survivors and other witnesses of the Holocaust in 63 countries and 39 languages, and providing search capability for those testimonies requires substantial KWS technologies in low language resource conditions, as for most languages, resources for developing KWS systems are not as rich as that for English.Despite the fact that KWS has been in the literature for a long time, KWS techniques in resource constrained conditions have not been researched extensively. In this dissertation, we improve KWS performance in two low resource conditions: low language resource condition where language specific data is inadequate, and low computation resource condition where KWS runs on computation constrained devices.For low language resource KWS, we focus on applications for speech data mining, where large vocabulary continuous speech recognition (LVCSR)-based KWS techniques are widely used. Keyword spotting for those applications are also known as keyword search (KWS) or spoken term detection (STD). A key issue for this type of KWS technique is the out-of-vocabulary (OOV) keyword problem. LVCSR-based KWS can only search for words that are defined in the LVCSR;;s lexicon, which is typically very small in a low language resource condition. To alleviate the OOV keyword problem, we propose a technique named ;;proxy keyword search;; that enables us to search for OOV keywords with regular LVCSR-based KWS systems. We also develop a technique that expands LVCSR;;s lexicon automatically by adding hallucinated words, which increases keyword coverage and therefore improves KWS performance. Finally we explore the possibility of building LVCSR-based KWS systems with limited lexicon, or even without an expert pronunciation lexicon.For low computation resource KWS, we focus on wake-word applications, which usually run on computation constrained devices such as mobile phones or tablets. We first develop a deep neural network (DNN)-based keyword spotter, which is lightweight and accurate enough that we are able to run it on devices continuously. This keyword spotter typically requires a pre-defined keyword, such as ;;Okay Google;;. We then propose a long short-term memory (LSTM)-based feature extractor for query-by-example KWS, which enables the users to define their own keywords.

【 预 览 】
附件列表
Files Size Format View
LOW RESOURCE HIGH ACCURACY KEYWORD SPOTTING 2049KB PDF download
  文献评价指标  
  下载次数:39次 浏览次数:18次