KLASIFIKASI CYBERBULLYING TWEET MENGGUNAKAN ALGORITMA SVM DAN XGBOOST

Main Article Content

Felix Fernando

Abstract

This study aims to classify cyberbullying tweets using two machine learning approaches, namely the SVM and XGBoost algorithms. The input data used for analysis consists of randomly collected tweets that have been labeled, feature extraction, and model training and testing using both algorithms. The results of the experiment show that XGBoost performs slightly better in classifying cyberbullying tweets than SVM. These findings contribute to the development of classification methods for detecting cyberbullying on social media, which can help mitigate its negative impact. This study also shows the potential use of the XGBoost algorithm in the context of cyberbullying detection on various other social media platforms such as TikTok and Instagram.

Article Details

Section

Articles

References

[1] Zhu, C., Huang, S., Evans, R. and Zhang, W., 2021. Cyberbullying among adolescents and children: a comprehensive review of the global situation, risk factors, and preventive measures. Frontiers in public health, 9, p.634909.

[2] Barlett, C.P., Simmers, M.M., Roth, B. and Gentile, D., 2021. Comparing cyberbullying prevalence and process before and during the COVID-19 pandemic. The journal of social psychology, 161(4), pp.408-418.

[3] Syah, R. and Hermawati, I., 2018. Upaya pencegahan kasus cyberbullying bagi remaja pengguna media sosial di Indonesia. Jurnal Penelitian Kesejahteraan Sosial, 17(2), pp.131-146.

[4] Febriana, T. and Budiarto, A., 2019, August. Twitter dataset for hate speech and cyberbullying detection in Indonesian language. In 2019 International Conference on Information Management and Technology (ICIMTech) (Vol. 1, pp. 379-382). IEEE.

[5] Jubaidi, M. and Fadilla, N., 2020. Pengaruh Fenomena Cyberbullying Sebagai Cyber-Crime di Instagram dan Dampak Negatifnya. Shaut Al-Maktabah: Jurnal Perpustakaan, Arsip dan Dokumentasi, 12(2), pp.117-134.

[6] J. Wang, K. Fu, C.T. Lu, “SOSNet: A Graph Convolutional Network Approach to Fine-Grained Cyberbullying Detection,” Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), December 10-13, 2020.

[7] Rahman, O.H., Abdillah, G. and Komarudin, A., 2021. Klasifikasi Ujaran Kebencian pada Media Sosial Twitter Menggunakan Support Vector Machine. Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 5(1), pp.17-23.

[8] Mascio, A., Kraljevic, Z., Bean, D., Dobson, R., Stewart, R., Bendayan, R. and Roberts, A., 2020. Comparative analysis of text classification approaches in electronic health records. arXiv preprint arXiv:2005.06624.

[9] Geofany, N. and Liza, R., 2021, October. Klasifikasi Sentimen Tweet Pada Twitter Terhadap Pembelajaran E-Learning Menggunakan Metode k-Nearest Neighbor. In Seminar Nasional Teknologi Informasi & Komunikasi (Vol. 1, No. 1, pp. 380-385).

[9] Khaira, U., Johanda, R., Utomo, P.E.P. and Suratno, T., 2020. Sentiment analysis of cyberbullying on twitter using SentiStrength. Indones. J. Artif. Intell. Data Min, 3(1), p.21.

[11] Irmanda, H.N. and Astriratma, R., 2020. Klasifikasi Jenis Pantun Dengan Metode Support Vector Machines (SVM). Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 4(5), pp.915-922.

[12] Rosid, M.A., Fitrani, A.S., Astutik, I.R.I., Mulloh, N.I. and Gozali, H.A., 2020, June. Improving text preprocessing for student complaint document classification using sastrawi. In IOP Conference Series: Materials Science and Engineering (Vol. 874, No. 1, p. 012017). IOP Publishing.

[13] Fikriani, A., Asror, I. and Murti, Y.R., 2019. Klasifikasi Kepribadian Berdasarkan Data Twitter dengan Menggunakan Metode Support Vector Machine. eProceedings of Engineering, 6(3). Application of Computer Sciences, 1(1), pp.27-32.