学位论文详细信息
Knowing a thing is "a thing": The use of acoustic features in multiword expression extraction
Collocations;Phrases;Speech processing;Language models
Jacobs, Cassandra L. ; Fleck ; Margaret
关键词: Collocations;    Phrases;    Speech processing;    Language models;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/92965/JACOBS-THESIS-2016.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Speakers of a language need to have complex linguistic representations for speaking, often on the level of non-literal, idiomatic expressions like black sheep. Typically, datasets of these so-called multiword expressions come from hand-crafted ontologies or lexicons, because identifying expressions like these in an unsupervised manner is still an unsolved problem in natural language processing. In this thesis I demonstrate that prosodic features, which are helpful in parsing syntax and interpreting meaning, can also be used to identify multiword expressions. To do this, I extracted noun phrases from the Buckeye corpus, which contains spontaneous spoken language, and matched these noun phrases to page titles in Wikipedia, a massive, freely available encyclopedic ontology of entities and phenomena. By incorporating prosodic features into a model that distinguishes between multiword expressions that are found in Wikipedia titles and those that are not, we see increases in classifier performance that suggests that prosodic cues can help with the automatic extraction of multiword expressions from spontaneous speech, helping models and potentially listeners decide whether something is "a thing" or not.

【 预 览 】
附件列表
Files Size Format View
Knowing a thing is "a thing": The use of acoustic features in multiword expression extraction 744KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:9次