Multitask deep learning for sentiment analysis with sarcasm detection in bilingual code-mixed social media content

Mohd Suhairi Md Suhaimin, Adi Wibowo, Ervin Gubin Moung, Patricia Anthony, Mohd Hanafi Ahmad Hijazi

Abstract


Sentiment analysis in social media often hindered by sarcasm, which can reverse text meaning, and bilingual code-mixing, which adds complexity in non-English primary context. Existing approaches extract separate features for each language and translate them into a single language, resulting in the loss of contextual meaning and omission of crucial features. This paper proposes a multitask learning model for sentiment analysis with sarcasm detection tailored to bilingual code-mixed social media content. A hybrid feature engineering technique is integrated into a multitask deep learning architecture designed to capture the nuances of sentiment and sarcasm while addressing the complexities of processing bilingual code-mixed content. The hybrid technique combines domain-knowledge-based natural language processing (NLP) with a deep learning-based embedding approach. It includes rule-based preprocessing, normalization, spellchecking, feature extraction and selection, and feature representation. The engineered features are integrated into a multitask deep learning network using bidirectional long short-term memory (Bi-LSTM) combined with gated recurrent units (GRU). Using a public dataset that contains bilingual code-mixed social media content related to public security, our proposed model achieved a higher F1score compared to two baseline models that employ single task and multitask approaches.

Keywords


Bilingual code-mixed; Deep learning; Hybrid feature engineering; Language model; Multitasking

Full Text:

PDF


DOI: https://doi.org/10.11591/eei.v15i2.10935

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats

Bulletin of Electrical Engineering and Informatics (BEEI)
ISSN: 2089-3191, e-ISSN: 2302-9285
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).