Skip header navigation
×

Conference Proceeding

A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter

Citation
Alqarafi A, Adeel A, Hawalah A, Swingler K & Hussain A (2018) A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter. In: Hussain A, Zhao H, Ren J, Zheng J, Liu C, Luo B & Zhao X (eds.) Advances in Brain Inspired Cognitive Systems. Lecture Notes in Computer Science, 10989. BICS 2018: 9th International Conference on Brain Inspired Cognitive Systems, Xi'an, China, 07.07.2018-08.07.2018. Cham, Switzerland: Springer International Publishing, pp. 589-596. https://doi.org/10.1007/978-3-030-00563-4_57

Abstract
In the literature, limited work has been conducted to develop sentiment resources for Saudi dialect. The lack of resources such as dialectical lexicons and corpora are some of the major bottlenecks to the successful development of Arabic sentiment analysis models. In this paper, a semi-supervised approach is presented to construct an annotated sentiment corpus for Saudi dialect using Twitter. The presented approach is primarily based on a list of lexicons built by using word embedding techniques such as word2vec. A huge corpus extracted from twitter is annotated and manually reviewed to exclude incorrect annotated tweets which is publicly available. For corpus validation, state-of-the-art classification algorithms (such as Logistic Regression, Support Vector Machine, and Naive Bayes) are applied and evaluated. Simulation results demonstrate that the Naive Bayes algorithm outperformed all other approaches and achieved accuracy up to 91%.

Keywords
Sentiment analysis; Saudi dialect; Word embedding

StatusPublished
Author(s)Alqarafi, Abdulrahman; Adeel, Ahsan; Hawalah, Ahmed; Swingler, Kevin; Hussain, Amir
Title of seriesLecture Notes in Computer Science
Number in series10989
Publication date31/12/2018
Publication date online06/10/2018
URLhttp://hdl.handle.net/1893/29408
PublisherSpringer International Publishing
Place of publicationCham, Switzerland
eISSN1611-3349
ISSN of series0302-9743
ISBN9783030005627
eISBN9783030005634
ConferenceBICS 2018: 9th International Conference on Brain Inspired Cognitive Systems
Conference locationXi'an, China
Dates

Research programmes

Research centres/groups

Scroll back to the top