Bi-Lingual (English, Punjabi) Sarcastic Sentiment Analysis by using Classification Methods
AIshana Attri1, Maitreyee Dutta2
1Ishana Attri, percussing M.E in computer science and engineering From Nitttr, Chandigarh also completed her B.tech from Himachal Pradesh Technical University, Hamirpur, (Himachal Pradesh) India.
2Dr. Maitreyee Dutta. Department of Computer Science & Engineering, National Institute of Technical Teachers Training and Research, Sector 26, Chandigarh – 160019, INDIA
Manuscript received on 30 June 2019 | Revised Manuscript received on 05 July 2019 | Manuscript published on 30 July 2019 | PP: 1374-1379 | Volume-8 Issue-9, July 2019 | Retrieval Number: I8053078919/19©BEIESP | DOI: 10.35940/ijitee.I8053.078919
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Sentiment analysis is one of the heated topic in the field of text mining. As the social media data is increased day by day the main need of the data scientists is to classify the data so that it can be further used for decision making or knowledge discovery. Now –a-days everything and everyone available online so to check the latest trends in business or in daily life one must consider the online data. The main focus of sentiment analysis is to focus on positive or negative comments so that a well define picture is created that what is trending or not but the sarcasm manipulates the data as in sarcastic comment negative comment consider as positive because of the presence of positive words in the comment or data so it is necessary to detect the sarcasm in online data . The data on social media is available in various languages so sentiment analysis in regional languages is also a main step . In the proposed work we focus on two languages i.e Punjabi and English. Here we use deep learning based neural networks for the sarcasm detection in English as well as Punjabi language. In the proposed work we consider three datasets i.e. balanced English dataset, Balanced Punjabi Dataset and unbalanced Punjabi dataset. We used six different models to check the accuracy of the classified data the models we used are LSTM with word embedding layer, BiLSTM with , LSTM+LSTM, BiLSTM+BiLSTM, LSTM+BiLSTM, CNN respectively. LSTM provide better accuracy for balanced Punjabi and English dataset i.e. 95.63% and 94.17% respectively. The accuracy for unbalanced Punjabi dataset is provided by BiLSTM i.e.96.31%.
Keywords: Sarcasm detection, LSTM, BiLSTM, sentiment analysis, multilingual
Scope of the Article: Predictive Analysis