Jurnal RESTI: Rekayasa Sistem dan Teknologi Informasi | |
Word2Vec on Sentiment Analysis with Synthetic Minority Oversampling Technique and Boosting Algorithm | |
article | |
Rayhan Rahmanda1  Erwin Budi Setiawan1  | |
[1] Telkom University | |
关键词: sentiment analysis; logistic regression; word2vec; twitter; | |
DOI : 10.29207/resti.v6i4.4186 | |
来源: Ikatan Ahli Indormatika Indonesia | |
【 摘 要 】
Customer opinion is an important aspect in determining the success of a company or service provider. By determining the sentiment of the existing opinion, the company can use it as an evaluation material to improve the quality of the service or product provided. Sentiment analysis can be used as a measure of opinion sentiment with input data in the form of a corpus which will be classified into positive or negative classes to obtain the level of customer satisfaction with a product or service. Aspect-based sentiment analysis can be used by companies to analyze more specifically and find out what aspects need to be improved. In this research, an aspect-based sentiment analysis was conducted on Telkomsel users on Twitter. The data used is 16,992 tweets from users who discuss several aspects such as Telkomsel's services and signals in Twitter. In this research Word2Vec was used for feature expansion to minimize vocabulary mismatch caused by limited words in tweets. The results showed that Word2Vec, Synthetic Minority Oversampling Technique (SMOTE), and Boosting algorithm combination with Logistic Regression classifier achieve highest accuracy of 95.10% for signal aspect and using hyperparameters makes the service aspect get the highest accuracy of 93.34%.
【 授权许可】
Unknown
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202307110004198ZK.pdf | 377KB | download |