学位论文详细信息
Improving neural language models on low-resource creole languages
Natural Language Processing;Neural Networks;Deep Learning;Linguistics;Creole Languages;Creolistics
Schieferstein, Sarah ; Hockenmaier ; Julia
关键词: Natural Language Processing;    Neural Networks;    Deep Learning;    Linguistics;    Creole Languages;    Creolistics;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/102512/SCHIEFERSTEIN-THESIS-2018.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

When using neural models for NLP tasks, like language modelling, it is difficult to utilize a language with little data, also known as a low-resource language. Creole languages are frequently low-resource and as such it is difficult to train neural language models for them well. Creole languages are a special type of language that is widely thought of as having multiple parents and thus receiving a mix of evolutionary traits from all of them. One of a creole language’s parents is known as the lexifier, which gives the creole its lexicon, and the other parents are known as substrates, which possibly are thought to give the creole language its morphology and syntax. Creole languages are most lexically similar to their lexifier and most syntactically similar to otherwise unrelated creole languages. High lexical similarity to the lexifier is unsurprising because by definition lexifiers provide a creole’s lexicon, but high syntactic similarity to the other unrelated creole languages is not obvious and is explored in detail. We can use this information about creole languages’ unique genesis and typology to decrease the perplexity of neural language models on low-resource creole languages. We discovered that syntactically similar languages (especially other creole languages) can successfully transfer learned features during pretraining from a high-resource language to a low-resource creole language through a method called neural stacking. A method that normalized the vocabulary of a creole language to its lexifier also lowered perplexities of creole-language neural models.

【 预 览 】
附件列表
Files Size Format View
Improving neural language models on low-resource creole languages 367KB PDF download
  文献评价指标  
  下载次数:26次 浏览次数:9次