期刊论文详细信息
Jurnal RESTI: Rekayasa Sistem dan Teknologi Informasi
Naïve Bayes-Support Vector Machine Combined BERT to Classified Big Five Personality on Twitter
article
Billy Anthony Christian Martani1  Erwin Budi Setiawan1 
[1] Telkom University
关键词: BERT;    Big Five Personality;    LIWC;    Naïve Bayes-Support Vector Machine;   
DOI  :  10.29207/resti.v6i6.4378
来源: Ikatan Ahli Indormatika Indonesia
PDF
【 摘 要 】

Twitter is one of the most popular social media used to interact online. Through Twitter, a person's personality can be determined based on that person's thoughts, feelings, and behavior patterns. A person has five main personalities likes Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism. This study will make five personality predictions using the Naïve Bayes method – Support Vector Machine, Synthetic Minority Over Sampling Technique (SMOTE), Linguistic Inquiry Word Count (LIWC), and Bidirectional Encoder from Transformers Representations (BERT). A questionnaire was distributed to people who used Twitter to collect and become a dataset in this research. The dataset obtained will be processed into SMOTE to balance the data. Linguistic Inquiry Word Count is used as a linguistic feature and BERT will be used as a semantic approach. The Naïve Bayes method is used to perform the weighting and the Support Vector Machine is used to classify Big Five Personalities. To help improve accuracy, the Optuna Hyperparameter Tuning method will be added to the Naïve Bayes Support Vector Machine model. This study has an accuracy of 87.82% from the results of combining SMOTE, BERT, LIWC, and Tuning where the accuracy increases from the baseline.

【 授权许可】

Unknown   

【 预 览 】
附件列表
Files Size Format View
RO202307110004257ZK.pdf 404KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:0次