Machine Learning Based Sentiment Analysis for Text Messages  
  Authors : Abhishek Bhagat; Akash Sharma; Sarat Kr. Chettri

 

People use online platforms such as Facebook, Twitter, etc. for social networking and share their opinions, feelings or beliefs with others. Sharing is done by posts on these platforms. Sentiment analysis or opinion mining of these posts using machine learning techniques is of great significance. The analysis is generally carried out with sentiment, subjectivity analysis or polarity calculations. In this study, we performed a sentiment analysis of text messages using supervised machine learning techniques. They are mainly online product reviews, general tweets in Tweeter and movie reviews. Messages are pre-processed and then used three different machine learning techniques, namely Naïve Bayes, Decision Tree and Support Vector Machine (SVM) for sentiment analysis.

 

Published In : IJCAT Journal Volume 7, Issue 6

Date of Publication : June 2020

Pages : 103-109

Figures :04

Tables :04

 

 

 

Abhishek Bhagat : is currently pursuing his BTech (CSE) from the Department of CSE, School of Technology, Assam Don Bosco University. Currently, he is in his final year. His areas of interest are Machine Learning, Big data analytics and Natural Language Processing.

Akash Sharma : is currently pursuing his BTech (CSE) from the Department of CSE, School of Technology, Assam Don Bosco University. He is a final year student at present. His areas of interest are Machine Learning, Internet of Things (IoT) and Natural Language Processing (NLP).

Dr. Sarat Kr. Chettri : is an Assistant Professor in the Department of Computer Applications, School of Technology, Assam Don Bosco University. He has made several publications in international journals and conferences. His research area includes data science, machine learning and Internet of Things (IoT).

 

 

 

 

 

 

 

Machine Learning, Natural Language Processing, Sentiment Analysis, Twitter

 

 

 

 

 

 

 

 

 

 

 

 

 

 

In this paper, text data from product reviews, general tweets and movie reviews are taken into account to assess the polarity (positive or negative) of messages or tweets. We used the classification algorithms namely SVM, Naïve Bayes and decision tree. We evaluated our models on the basis of metrics; classification accuracy, precision, recall, F1-score, and ROC curve. After evaluating the developed classifiers, we find that the results obtained from the Decision Tree and SVM have a lower mean square error or a higher accuracy with most of the datasets and are considered to be good classifiers. We find our work to be unique, as we have attempted in our study to provide an overview of the various methods used in the sentiment analysis of text data. We have also built and compared three different classifiers using machine learning techniques to five different datasets of varying sizes and domains.

 

 

 

 

 

 

 

 

 

[1] P. Nivaashini, M., Soundariya, R. S, Kodiieswari, A. & Thangaraj, "SMS Spam Detection Using Neural Network Classifier," Int. J. Pure Appl. Math., vol. 119, no. 18, pp. 2425-2436, 2018. [2] A. Mizuki, T. Matsumoto, T. Uemura, and S. Kichimi, "Improving SMS Processing Power for the Increasing Smartphone Demand," NTT DOCOMO Tech. J., vol. 14, no. 4, pp. 60-62, 2013. [3] V. K. Katankar, "Short Message Service using SMS Gateway," Int. J. Comput. Sci. Eng., vol. 2, no. 5, pp. 1487-1491, 2010. [4] N. Choudhary and A. K. Jain, "Towards filtering of SMS Spam Messages using Machine Learning-Based Technique," in Communications in Computer and Information Science, 2017, vol. 712, pp. 18-30. [5] W. N. Gansterer, A. G. K. Janecek, and R. Neumayer, "Spam filtering based on latent semantic indexing," in Survey of Text Mining II: Clustering, Classification, and Retrieval, 2008, pp. 165-183. [6] M. Gupta, A. Bakliwal, S. Agarwal, and P. Mehndiratta, "A Comparative Study of Spam SMS Detection Using Machine Learning Classifiers," in 2018 11th International Conference on Contemporary Computing, IC3 2018, 2018, pp. 1-7. [7] M. Abdulhamid, M. Shafie, A. Latiff, H. Chiroma, and O. Osho, "A Review on Mobile SMS Spam Filtering Techniques A Review on Mobile SMS Spam Filtering Techniques," no. February, 2017. [8] T. M. & Mahmoud and A. M. Mahfouz, "SMS Spam Filtering Technique Based on Artificial Immune System," Int. J. Comput. Sci. Issues, vol. 9, no. 2, pp. 589-597, 2012. [9] N. Chaudhari, P. Jayvala, and P. Vinitashah, "Survey on Spam SMS filtering using Data mining Techniques," Ijarcce, vol. 5, no. 11, pp. 193-195, 2016. [10] A. K. Uysal, S. Gunal, S. Ergin, and E. S. Gunal, "The Impact of Feature Extraction and Selection on SMS Spam Filtering," Elektron. IR ELEKTROTECHNIKA, vol. 19, no. 5, pp. 67-72, 2014. [11] T. Subramaniam, H. A. Jalab, and A. Y. Taqa, "Overview of textual anti-spam filtering techniques," Int. J. Phys. Sci., vol. 5, no. 12, pp. 1869-1882, 2010. [12] A. K. Uysal, S. Gunal, S. Ergin, and E. S. Gunal, "The impact of feature extraction and selection on SMS spam filtering," Elektron. ir Elektrotechnika, vol. 19, no. 5, pp. 67-72, 2013. [13] R. Kaur and R. Rajput, "Face recognition and its various techniques : a review," Int. J. Sci. Eng. Technol. Res., vol. 2, no. 3, pp. 670-675, 2013. [14] S. Telgaonkar, A. H & Deshmukh, "Dimensionality Reduction and Classification through PCA and LDA," Int. J. Comput. Appl., vol. 122, no. 17, pp. 4-8, 2015. [15] D. Suleiman and G. Al-Naymat, "SMS Spam Detection using H2O Framework SMS Spam Detection using H2O Framework," in Procedia Computer Science, 2017, vol. 113, pp. 154-161. [16] M. Ramabai, "Spam Detection using NLP Techniques," Int. J. Recent Technol. Eng., vol. 8, no. 2, pp. 2423-2426, 2019. [17] H. H. Mansoor and S. H. Shaker, "Using classification techniques to SMS spam filter," Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 12, pp. 1734-1739, 2019. [18] A. S. Rajput, J. S. Sohal, and V. Athavale, "Email Header Feature Extraction using Adaptive and Collaborative approach for Email Classification," Int. J. Innov. Technol. Explor. Eng., vol. 8, no. 7, pp. 158-164, 2019. [19] R. K. Kaliyar, P. Narang, and A. Goswami, "SMS Spam Filtering on Multiple Background Datasets Using Machine Learning Techniques: A Novel Approach," in Proceedings of the 8th International Advance Computing Conference, IACC 2018, 2018, pp. 59-65. [20] O. F. Kaya, Y., & Ertugrul, "A Novel Feature Extraction Approach in SMS Spam Filtering for Mobile Communication: One-Dimensional Ternary Patterns," Secure. Commun. Networks, vol. 9, pp. 4680-4690, 2016. [21] O. O. Abayomi-alli, S. A. Onashoga, and A. S. Sodiya, "A Critical Analysis of Existing SMS SPAM Filtering Approaches," in 1st International Conference on Applied Information Technology, 2015, pp. 211-220.