学位论文详细信息
Keywords at Work: Investigating Keyword Extraction in Social Media Applications
Keyword Extraction;Emails;usage expressions;Piazza forum;student performance prediction;Computer Science;Engineering;Computer Science & Engineering
Lahiri, ShibamouliTarau, Paul ;
University of Michigan
关键词: Keyword Extraction;    Emails;    usage expressions;    Piazza forum;    student performance prediction;    Computer Science;    Engineering;    Computer Science & Engineering;   
Others  :  https://deepblue.lib.umich.edu/bitstream/handle/2027.42/145929/lahiri_1.pdf?sequence=1&isAllowed=y
瑞士|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

This dissertation examines a long-standing problem in Natural Language Processing (NLP) -- keyword extraction -- from a new angle. We investigate how keyword extraction can be formulated on social media data, such as emails, product reviews, student discussions, and student statements of purpose. We design novel graph-based features for supervised and unsupervised keyword extraction from emails, and use the resulting system with success to uncover patterns in a new dataset -- student statements of purpose. Furthermore, the system is used with new features on the problem of usage expression extraction from product reviews, where we obtain interesting insights. The system while used on student discussions, uncover new and exciting patterns.While each of the above problems is conceptually distinct, they share two key common elements -- keywords and social data. Social data can be messy, hard-to-interpret, and not easily amenable to existing NLP resources. We show that our system is robust enough in the face of such challenges to discover useful and important patterns. We also show that the problem definition of keyword extraction itself can be expanded to accommodate new and challenging research questions and datasets.

【 预 览 】
附件列表
Files Size Format View
Keywords at Work: Investigating Keyword Extraction in Social Media Applications 512KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:2次