Toxic Comment Classification using Deep Learning

Main Article Content

B. Ramesh Naidu
Naresh Tangudu
Chandra Sekhar
K. Kavitha
B. V. Ramana
P.Venkateswarlu Reddy
Jayavardhanarao Sahukaru
Raj Ganesh Lopinti

Abstract

Online Conversation media serves as a means for individuals to engage, cooperate, and exchange ideas; however, it is also considered a platform that facilitates the spread of hateful and offensive comments, which could significantly impact one's emotional and mental health. The rapid growth of online communication makes it impractical to manually identify and filter out hateful tweets. Consequently, there is a pressing need for a method or strategy to eliminate toxic and abusive comments and ensure the safety and cleanliness of social media platforms. Utilizing LSTM, Character-level CNN, Word-level CNN, and Hybrid model (LSTM + CNN) in this toxicity analysis is to classify comments and identify the different types of toxic classes by means of a comparative analysis of various models. The neural network models utilized for this analysis take in comments extracted from online platforms, including both toxic and non-toxic comments. The results of this study can contribute towards the development of a web interface that enables the identification of toxic and hateful comments within a given sentence or phrase, and categorizes them into their respective toxicity classes.

Article Details

How to Cite
Naidu, B. R. ., Tangudu, N. ., Sekhar, C. ., Kavitha, K. ., Ramana, B. V., Reddy, P. . ., Sahukaru, J. ., & Lopinti, R. G. . (2023). Toxic Comment Classification using Deep Learning. International Journal on Recent and Innovation Trends in Computing and Communication, 11(7), 93–104. https://doi.org/10.17762/ijritcc.v11i7.7834
Section
Articles

References

Ayush kumar, Pratik Kumar (2021). “Investigating Bias in Automatic Toxic Comment Detection: An Empirical Study”. arXiv:2108.06487 cs.CL]

Georgios Patoulidid, Jonas Bokstaller (2021). “MAygual Zagidullinaodel Bias in NLP Application to Hate Speech classification using transfer learning techniques”. arXiv:2109.09725v4 cs.CL]

Zhixue Zhao, Ziqi Zhang, Frank Hopfgartner (2021). “A comparative Study of using Pre-trained Language Models for Toxic Comment Classification”. IW3C2.

Ashwin Geet, Irina Illin, Dominique Fohr (2021). “Classification of Hate Speech Using Deep Neural Networks”. CERIST 25(01). Hal-03101938

Mihir Gada, Kaustubh Damania, Smita Sankhe (2021). “Cyberbullying Detection Using LSTM-CNN architecture and its applications”. IEEE 978-7281-5875-4.

Kunfu Wang, Pengyi Zhang and Jian Su (2020). “A Text Classification Method Based on the Merge-LSTM-CNN Model”. ICNISC journal of Physics: conference Series.

Muhammad Abubakar, Aminu Tukur, Usman Bukar usman (2020). “An Improved Multi-labeled LSTM Toxic Comment Classification”. Journal of Applied Sciences, Information, and computing Vo1ume 1, Number 2.

An Intelligent Metaheuristic Optimization with Deep Convolutional Recurrent Neural Network Enabled Sarcasm Detection and Classification Model, International Journal of Advanced Computer Science and Applications(IJACSA), Vol. 13, No. 2, 2022 ..

Gunjal, M. B. ., & Sonawane, V. R. . (2023). Multi Authority Access Control Mechanism for Role Based Access Control for Data Security in the Cloud Environment. International Journal of Intelligent Systems and Applications in Engineering, 11(2s), 250 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2623

Vidyullatha, Satya Narayan Padhy, Javvaji Geetha Priya (2020). “Identification and Classification of Toxic Comment Using Machine Learning Methods”. Turkish Journal of Computer and Mathematics Education. Volume 12, Number 9.

Sara Zaheri, Jeff Leath, David Stroud (2020). “Toxic Comment Classification”. SMU Data science Review. Volume 3, Number 1.

Sergey Morzhov (2020) “Avoiding Unintended Bias in Toxicity Classification with neural networks”. 26th Conference of fruct association.

Krishna Dubey, Rahul Nair, Mohd. Usman Khan (2020). “Toxic Comment Detection using LSTM”. Third International conference on ICAECC.

Hao Li, Weiquan Mao, Hanyuan Liu (2019). “Toxic Comment Detection and Classification”. LREC.

Nayan Banik, Md. Hasan Hanfizur Rahman (2019). “Toxicity Detection on Bengali Social Media Comments using Supervised Models.”. ICIET. Volume 23, Number 24.

Mladen Karan and Jan Snadjer (2019). “Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context”. Association for Computational Linguistics.

B. Vidgen, A. Harris, D. Nguyen, R. Tromble, S. Hale, and H. Margetts (2019). “Challenges and frontiers in abusive content detection”. Association for Computational Linguistics.

S. V. Georgakopoulos, S. K. Tasoulis, A. G. Vrahatis, and V. P. Plagianakos (2019), “Convolutional neural networks for twitter text toxicity analysis”.

Alejandro Garcia, Machine Learning for Customer Segmentation and Targeted Marketing , Machine Learning Applications Conference Proceedings, Vol 3 2023.

Mai Ibrahim, Marwan Torki and Nagwa El-Makky (2018). “Imbalanced Toxic Comments Classification using Data Augmentation and Deep Learning”. 17th IEEE International Conference on Machine Learning and Applications. In INNS Big Data and Deep Learning conference. Springer, 2019, pp. 370–379.

Spiros V. Georgakopoulos, Sotiris K. Tasoulis, Aristidis G. Vrahatis, Vassilis P. Plagianakos (2018). “Convolutional Neural Network for Toxic Comment Classification”. Association for Computing Machinery. SETN.

Kehn Wang, Jiaxi, Hongjun Wu (2018). “A survey of Toxic Comment Classification Methods”.

Jiarui Zhang, Yingxiang Li, Juan Tian, Tongyan Li (2018). “LSTM-CNN Hybrid Model for Text Classification”. Third Conference, IAEAC.

Z. Z. David Robinson and J. Tepper (2018). “Hate speech detection on twitter: Feature engineering vs feature selection”. The Semantic Web: ESWC 2018 Satellite Events, pages 46–49.

van Aken, B., Risch, J., Krestel, R., & Löser, A. (2018). “Challenges for toxic comment classification: An in-depth error analysis”. arXiv preprint arXiv:1809.07572.

Mujahed A. Saif, Alexander N. Medvedev, Maxim A. Medvedev, TodorkaAtanasova (2018). “Classification of Online Toxic Comments Using the Logistic Regression and Neural Networks Models”.

Mr. B. Naga Rajesh. (2019). Effective Morphological Transformation and Sub-pixel Classification of Clustered Images. International Journal of New Practices in Management and Engineering, 8(01), 08 - 14. https://doi.org/10.17762/ijnpme.v8i01.74

Zhihao Guo, Shangquan Sun, Yicheng Zhou (2017). “Classification of Comment Toxicity”. 17th Conference, Association for Computing Machinery.

Moore, B., Clark, R., Martinez, J., Rodriguez, A., & Rodriguez, L. Anomaly Detection in Internet of Things (IoT) Data Streams. Kuwait Journal of Machine Learning, 1(4). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/151

M. G. V. V. Pinkesh Badjatiya, Shashank Gupta (2017). “Deep learning for hate speech detection in tweets”. WWW’17’, pages 759– 760.

K..Kavitha, Ch.Suneetha, Efficient Sentimental Analysis Using Hybrid Deep Transfer Learning Neural Network, International Journal of Trends and Technology(IJETT),ISSN: 2231-5381, Vol. 70 .Issue 10, 155-165, October-2022

Waseem, Z., Davidson, T., Warmsley, D. and Weber (2017). “Understanding Abuse: A Typology of Abusive Language Detection Subtasks”. In Proceedings of the First Workshop on Abusive Language Online (pp. 78-84).

Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y. and Chang, Y. (2016). Abusive language detection in online user content. In Proceedings of the 25th international conference on world wide web (pp. 145-153).

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. (2013). Distributed Representations of Words and Phrases and their Compositionality. In Advances in Neural Information Processing Systems 26, C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 3111–3119.

M Ramakrishna Murty, JVR Murthy, and Prasad Reddy PVGD. (2011). Text Document Classification based-on Least Square Support Vector Machines with Singular Value Decomposition. International Journal of Computer Applications