期刊论文详细信息
Applied Sciences
MULDASA: Multifactor Lexical Sentiment Analysis of Social-Media Content in Nonstandard Arabic Social Media
Saad Alanazi1  Ghadah Alwakid1  Mamoona Humayun2  Najm Us Sama3  Mahmoud El Haj4  Taha Osman5 
[1] Department of Computer Science, College of Computer and Information Sciences, Jouf University, Sakaka 72311, Saudi Arabia;Department of Information Systems, College of Computer and Information Sciences, Jouf University, Sakaka 72311, Saudi Arabia;Faculty of Computer Science and Information Technology, Universiti Malaysia Sarawak, Kota Samarahan 94300, Sarawak, Malaysia;School of Computing and Communications, Lancaster University, Clifton Lane, Lancaster LA1 4YW, UK;School of Science and Technology, Nottingham Trent University, Clifton Lane, Nottingham NG11 8NS, UK;
关键词: sentiment analysis;    Arabic NLP;    lexical;    Saudi dialects;    Arabic social media;    Twitter;   
DOI  :  10.3390/app12083806
来源: DOAJ
【 摘 要 】

The semantically complicated Arabic natural vocabulary, and the shortage of available techniques and skills to capture Arabic emotions from text hinder Arabic sentiment analysis (ASA). Evaluating Arabic idioms that do not follow a conventional linguistic framework, such as contemporary standard Arabic (MSA), complicates an incredibly difficult procedure. Here, we define a novel lexical sentiment analysis approach for studying Arabic language tweets (TTs) from specialized digital media platforms. Many elements comprising emoji, intensifiers, negations, and other nonstandard expressions such as supplications, proverbs, and interjections are incorporated into the MULDASA algorithm to enhance the precision of opinion classifications. Root words in multidialectal sentiment LX are associated with emotions found in the content under study via a simple stemming procedure. Furthermore, a feature–sentiment correlation procedure is incorporated into the proposed technique to exclude viewpoints expressed that seem to be irrelevant to the area of concern. As part of our research into Saudi Arabian employability, we compiled a large sample of TTs in 6 different Arabic dialects. This research shows that this sentiment categorization method is useful, and that using all of the characteristics listed earlier improves the ability to accurately classify people’s feelings. The classification accuracy of the proposed algorithm improved from 83.84% to 89.80%. Our approach also outperformed two existing research projects that employed a lexical approach for the sentiment analysis of Saudi dialects.

【 授权许可】

Unknown   

  文献评价指标  
  下载次数:0次 浏览次数:0次