HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis

Salahudeen, S.A. and Lawan, F.I. and Wali, A.M. and Imam, A.A. and Shuaibu, A.R. and Yusuf, A. and Rabiu, N.B. and Bello, M. and Adamu, S.U. and Aliyu, S.M. and Gadanya, M.S. and Muaz, S.A. and Ahmad, M.S. and Abdullahi, A. and Jamoh, A.Y. (2023) HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis. In: UNSPECIFIED.

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

We present the findings of SemEval-2023 Task 12, a shared task on sentiment analysis for low-resource African languages using Twitter dataset. The task featured three subtasks; subtask A is monolingual sentiment classification with 12 tracks which are all monolingual languages, subtask B is multilingual sentiment classification using the tracks in subtask A and subtask C is a zero-shot sentiment classification. We present the results and findings of subtask A, subtask B and subtask C. We also release the code on github. Our goal is to leverage low-resource tweet data using pretrained Afro-xlmr-large, AfriBERTa-Large, Bert-base-arabic-camelbert-da-sentiment (Arabic-camelbert), Multilingual-BERT (mBERT) and BERT models for sentiment analysis of 14 African languages. The datasets for these subtasks consists of a gold standard multi-class labeled Twitter datasets from these languages. Our results demonstrate that Afroxlmr-large model performed better compared to the other models in most of the languages datasets. Similarly, Nigerian languages: Hausa, Igbo, and Yoruba achieved better performance compared to other languages and this can be attributed to the higher volume of data present in the languages. © 2023 Association for Computational Linguistics.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: cited By 1; Conference of 17th International Workshop on Semantic Evaluation, SemEval 2023, co-located with the 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 ; Conference Date: 13 July 2023 Through 14 July 2023; Conference Code:192857
Uncontrolled Keywords: C (programming language); Classification (of information); Large dataset; Semantics; Social networking (online); Zero-shot learning, African languages; Gold standards; High volumes; Large models; Nigerians; Performance; Sentiment analysis; Sentiment classification; Subtask, Sentiment analysis
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 04 Jun 2024 14:11
Last Modified: 04 Jun 2024 14:11
URI: https://khub.utp.edu.my/scholars/id/eprint/19282

Actions (login required)

View Item
View Item