Automating Twitter Data Annotation Process for Sentiment Analysis

Hasanein Alharbi

doi:10.29196/jubpas.v33i4.6146

PDF

Published: 02-01-2026

DOI: https://doi.org/10.29196/jubpas.v33i4.6146

Keywords:

Sentiment Analysis, Machine Learning, Semantic Similarity, NRC Lexicon Thesaurus

Hasanein Alharbi

College of Information Technology, University of Babylon, Hilla , Iraq

Abstract

Background:

Sentiment analysis algorithms require high-quality annotated data during the training phase. However, this requirement has led to complex, time-consuming and costly manual data annotation process. To address these challenges, this research proposes an automatic data annotation process for sentiment analysis.

Materials and Methods:

Three semantic orientation measures (Pointwise Mutual Information, latent Semantic Analysis, and Word2Vec), five classification algorithms (K-Nearest Neighbors, Logistic Regression, naïve Bayes, Random Forest, Support Vector Machine) and NRC lexicon thesaurus are used to automate the process of tweet annotation for sentiment analysis.

Results:

Tweets were annotated using five classifiers and three semantic measures, forming fifteen combinations. The Inter-Annotator Agreement (IAA) among these combinations was evaluated using Cohen’s Kappa statistic. The obtained results show that (Pointwise Mutual Information + Logistic Regression) and (Pointwise Mutual Information + Naïve Bayes) achieved the highest agreement score of 0.7008.

Conclusion:

These results have shown that the corpus-based semantic orientation measures have provided substantive results. However, it can still be enhanced through the use of a broader vocabulary, the application of contextual information and the implementation of the newest deep learning algorithms.

Issue

Vol. 33 No. 4 (2025): Vol.33 Issue 4 ( 2025)

Section

Articles

This work is licensed under a Creative Commons Attribution 4.0 International License.

How to Cite

[1]

“Automating Twitter Data Annotation Process for Sentiment Analysis”, JUBPAS, vol. 33, no. 4, pp. 114–136, Jan. 2026, doi: 10.29196/jubpas.v33i4.6146.

Similar Articles

Rusul Hakim Ali , Aiad Ali Hussien Al-Zaidy , Lithofacies Analysis and Depositional Development of Zubair Formation in West Qurna Oil Field, Southern Iraq , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 28 No. 3 (2020)
Israa Luay AL-jaryan, Ali Hmood AL-saady, Fadil Rasul Al-Khafagy, Polymorphism of Microsatellite markers and ‎Their Association with Egg Production Traits in ‎Iraqi Chickens , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 26 No. 2 (2018)
Qasim Shakir Kadhim , S.A.A. AL Saati , Study of Modeling of Large-Scale Atmospheric Circulation Using Mathematics , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol.31 No 4 ( 2023)
Zaid Ibrahim Rasool, Using Support Vector Machine to Detect Data Hiding in Color Images , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 32 No. 2 (2024): Vol.32 No 2 ( 2024)
Nisreen Saad Hadi, Predictive Intelligence Against Fake News Through Intent-Based Language Analysis , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 33 No. 4 (2025): Vol.33 Issue 4 ( 2025)
Ali Al-Jawdah, Novel Trend in Development of QCM System Based on Analysis of Adsorption Kinetics , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 30 No. 2 (2022)
Faraidun K. Hamasalh, Seaman S. Hamasalh, Spline Fractional Polynomial for Computing Fractional Differential Equations , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 30 No. 2 (2022)
Huda. A. Daham, Ajel. S.Y. AL-Hadadi, Evaluation of Dispersion for Soil Properties from Faw City- Southern Iraq , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol 31 No 1 (2023)
Ameera Kamal Khaleel, Haider Mehdi Hamid, Mortada Mahmoud Nouri, Mahdi Emad Mahdi, Rasool Hammed Abbas, Youssef Mohamed Talib, Prevalence of Dental Anomalies in an Adult Dentate Najaf /Iraqi Population by Using Digital Panoramic Radiographs , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol 30 No 3 (2022)
Isrra Adnan Auda Khadhim, Wurood Alwan Kadhim, Manar Mohammad Hasan Al-Murshidi, A Comparative Anatomical and Histological Analysis of some Iraqi Birds' Liver: Review , JOURNAL OF UNIVERSITY OF BABYLON for Pure and Applied Sciences: Vol. 33 No. 4 (2025): Vol.33 Issue 4 ( 2025)

You may also start an advanced similarity search for this article.

Article Sidebar

Main Article Content

Abstract

Article Details

Issue

Section

How to Cite

Similar Articles