Annotated dataset for sentiment analysis and sarcasm detection: bilingual code-mixed english-malay social media data in the public security domain

Mohd Suhairi Md Suhaimin and Mohd Hanafi Ahmad Hijazi and Ervin Gubin Moung (2024) Annotated dataset for sentiment analysis and sarcasm detection: bilingual code-mixed english-malay social media data in the public security domain. Sciencedirect, 55. pp. 1-9. ISSN 2352-3409

[img] Text
FULL TEXT.pdf
Restricted to Registered users only

Download (636kB) | Request a copy

Abstract

Sentiment analysis in the public security domain involves analysing public sentiment, emotions, opinions, and attitudes toward events, phenomena, and crises. However, the complexity of sarcasm, which tends to alter the intended meaning, combined with the use of bilingual code-mixed content, hampers sentiment analysis systems. Currently, limited datasets are available that focus on these issues. This paper introduces a comprehensive dataset constructed through a systematic data acquisition and annotation process. The acquisition process includes collecting data from social media platforms, starting with keyword searching, querying, and scraping, resulting in an acquired dataset. The subsequent annotation process involves refining and labelling, starting with data merging, selection, and annotation, ending in an annotated dataset. Three expert annotators from different fields were appointed for the labelling tasks, which produced

Item Type: Article
Keyword: Sarcastic sentiment Multilingual Southeast asia Public security threat Multitask learning
Subjects: Q Science > Q Science (General) > Q1-390 Science (General) > Q300-390 Cybernetics
T Technology > TA Engineering (General). Civil engineering (General) > TA1-2040 Engineering (General). Civil engineering (General) > TA1501-1820 Applied optics. Photonics
Department: FACULTY > Faculty of Computing and Informatics
Depositing User: ABDULLAH BIN SABUDIN -
Date Deposited: 15 Apr 2025 16:10
Last Modified: 15 Apr 2025 16:10
URI: https://eprints.ums.edu.my/id/eprint/43511

Actions (login required)

View Item View Item