会议论文详细信息
14th International Conference on Science, Engineering and Technology
Real Time Text Analysis
自然科学;工业技术
Senthilkumar, K.^1 ; Ruchika Mehra Vijayan, E.^1
VIT Univeristy, Vellore
Tamilnadu
632014, India^1
关键词: Apache hadoop;    Distributed computations;    Large scale data;    On the flies;    Princeton University;    Real time analysis;    SentiWordNet;    Text analysis;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/263/4/042005/pdf
DOI  :  10.1088/1757-899X/263/4/042005
来源: IOP
PDF
【 摘 要 】

This paper aims to illustrate real time analysis of large scale data. For practical implementation we are performing sentiment analysis on live Twitter feeds for each individual tweet. To analyze sentiments we will train our data model on sentiWordNet, a polarity assigned wordNet sample by Princeton University. Our main objective will be to efficiency analyze large scale data on the fly using distributed computation. Apache Spark and Apache Hadoop eco system is used as distributed computation platform with Java as development language.

【 预 览 】
附件列表
Files Size Format View
Real Time Text Analysis 449KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:20次