科技报告详细信息
Identifying Themes in Social Media and Detecting Sentiments
Pal, Jayanta Kumar ; Saha, Abhisek
HP Development Company
关键词: Social Media;    Text mining;   
RP-ID  :  HPL-2010-50
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Recently, a huge wave of social media has generated significant impact in people's perceptions about technological domains. They are captured in several blogs/forums, where the themes relate to products of several companies. One of the companies can be interested to track them as resources for customer perceptions and detect user sentiments. The keyword- based approaches for identifying such themes fail to give satisfactory level of accuracy. Here, we address the above problems using statistical text-mining of blog entries. The crux of the analysis lies in mining quantitative information from textual entries. Once the relevant blog entries for the company/its competitors are filtered out, the theme identification is performed using a highly accurate novel technique termed as 'Best Separators Algorithm'. Logistic regression coupled with dimension reduction technique (singular value decomposition) is used to identify the tonality of those blogs. The final analysis shows significant improvement in terms of accuracy over popular approaches.

【 预 览 】
附件列表
Files Size Format View
RO201804100002767LZ 302KB PDF download
  文献评价指标  
  下载次数:22次 浏览次数:76次