PERBANDINGAN NAIVE BAYES, RANDOM FOREST, XGBOOST UNTUK KLASIFIKASI MUTU AIR
Main Article Content
Abstract
Penelitian ini bertujuan untuk membandingkan kinerja algoritma Naive Bayes, Random Forest, dan XGBoost dalam mengklasifikasikan kualitas air budidaya perikanan. Kualitas air merupakan faktor penting dalam menjaga kesehatan organisme akuatik serta mengoptimalkan produktivitas akuakultur. Penelitian ini menggunakan dataset yang mencakup beberapa parameter, seperti pH, suhu, oksigen terlarut, salinitas, dan kekeruhan, yang digunakan sebagai fitur masukan untuk memprediksi kelas kualitas air secara keseluruhan. Data dibagi menjadi dua bagian, yaitu data pelatihan dan data pengujian. Hasil eksperimen menunjukkan bahwa algoritma Random Forest mencapai akurasi klasifikasi tertinggi sekaligus waktu pemrosesan tercepat dibandingkan dengan XGBoost dan Naive Bayes. Algoritma XGBoost menghasilkan performa yang kompetitif dengan sensitivitas yang sedikit lebih tinggi terhadap variasi data, sedangkan Naive Bayes menunjukkan efisiensi komputasi yang tinggi namun dengan akurasi yang lebih rendah. Secara keseluruhan, Random Forest memberikan kinerja yang paling konsisten dan akurat dalam mengklasifikasikan kualitas air budidaya, sehingga dinilai paling cocok untuk diterapkan pada sistem pemantauan kualitas air secara real-time.
Downloads
Article Details
Section

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This work is licensed under a Jurnal Komunikasi Creative Commons Attribution-ShareAlike 4.0 International License.
How to Cite
References
[1] T. Li, J. Lu, J. Wu, Z. Zhang, and L. Chen, “Predicting Aquaculture Water Quality Using Machine Learning Approaches,” Water (Switzerland), vol. 14, no. 18, Sep. 2022, doi: 10.3390/w14182836.
[2] M. Miller, A. Kisiel, D. Cembrowska-Lech, I. Durlik, and T. Miller, “IoT in Water Quality Monitoring—Are We Really Here?,” Sensors, vol. 23, no. 2, Jan. 2023, doi: 10.3390/s23020960.
[3] S. Mamatha, P. Aakash, and N. Sravya, “Water Quality Analysis and Prediction Using Random Forest and Naive Bayes.”
[4] P. Jayaraman, K. K. Nagarajan, P. Partheeban, and V. Krishnamurthy, “Critical review on water quality analysis using IoT and machine learning models,” International Journal of Information Management Data Insights, vol. 4, no. 1, Apr. 2024, doi: 10.1016/j.jjimei.2023.100210.
[5] M. Y. Shams, A. M. Elshewey, E. S. M. El-kenawy, A. Ibrahim, F. M. Talaat, and Z. Tarek, “Water quality prediction using machine learning models based on grid search method,” Multimed Tools Appl, vol. 83, no. 12, pp. 35307–35334, Apr. 2024, doi: 10.1007/s11042-023-16737-4.
[6] W. Chen, D. Xu, B. Pan, Y. Zhao, and Y. Song, “Machine Learning-Based Water Quality Classification Assessment,” Water (Switzerland), vol. 16, no. 20, Oct. 2024, doi: 10.3390/w16202951.
[7] N. Radhakrishnan and A. S. Pillai, “Comparison of Water Quality Classification Models using Machine Learning,” Institute of Electrical and Electronics Engineers (IEEE), Jul. 2020, pp. 1183–1188. doi: 10.1109/icces48766.2020.9137903.
[8] M. S. Islam Khan, N. Islam, J. Uddin, S. Islam, and M. K. Nasir, “Water quality prediction and classification based on principal component regression and gradient boosting classifier approach,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, pp. 4773–4781, Sep. 2022, doi: 10.1016/j.jksuci.2021.06.003.
[9] M. H. Al-Adhaileh and F. W. Alsaade, “Modelling and prediction of water quality by using artificial intelligence,” Sustainability (Switzerland), vol. 13, no. 8, Apr. 2021, doi: 10.3390/su13084259.
[10] D. Danuri and M. Mohd Pozi, “Machine Learning Approaches for Fish Pond Water Quality Classification: Random Forest, Gaussian Naive Bayes, and Decision Tree Comparison,” European Alliance for Innovation n.o., Feb. 2024. doi: 10.4108/eai.21-9-2023.2342964.
[11] Zhang, X., Li, Y., Chen, Z., & Wu, H. (2024). Monitoring water quality parameters of freshwater aquaculture ponds using UAV-based multispectral images. Ecological Indicators, 162, 111234. https://doi.org/10.1016/j.ecolind.2024.111234
[12] X. Liu et al., “Monitoring water quality parameters of freshwater aquaculture ponds using UAV-based multispectral images,” Ecol Indic, vol. 167, Oct. 2024, doi: 10.1016/j.ecolind.2024.112644.
[13] A. Elmotawakkil, N. Enneya, S. K. Bhagat, M. M. Ouda, and V. Kumar, “Advanced machine learning models for robust prediction of water quality index and classification,” Journal of Hydroinformatics, vol. 27, no. 2, pp. 299–319, Feb. 2025, doi: 10.2166/hydro.2025.290.
[14] R. R S and E. Antony, “Water Quality Analysis of Different Water Sources in Kerala, India,” International Journal of Agriculture & Environmental Science, vol. 9, no. 4, pp. 6–11, Aug. 2022, doi: 10.14445/23942568/ijaes-v9i4p102.
[15] Dewi, D. A., Wei, A. S., Lin, L. C., & Heng, C. D. (2024). Water quality prediction using Random Forest algorithm and optimization. Journal of Data Science and Environmental Informatics.
[16] “Aquaculture - Water Quality Dataset.” Accessed: Oct. 08, 2025. [Online]. Available: https://data.mendeley.com/datasets/y78ty2g293/1
[17] X. Yan, T. Zhang, W. Du, Q. Meng, X. Xu, and X. Zhao, “A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years,” Jan. 01, 2024, Multidisciplinary Digital Publishing Institute (MDPI). doi: 10.3390/jmse12010159.
[18] P. Patel, “Machine Learning Applications in Agriculture: A Software Engineering Perspective,” Jul. 13, 2025. doi: 10.31224/4824.
[19] D. B. Olawade, O. Z. Wada, A. O. Ige, B. I. Egbewole, A. Olojo, and B. I. Oladapo, “Artificial intelligence in environmental monitoring: Advancements, challenges, and future directions,” Dec. 01, 2024, Elsevier B.V. doi: 10.1016/j.heha.2024.100114.
[20] M. Jemeļjanova, A. Kmoch, and E. Uuemaa, “Adapting machine learning for environmental spatial data - A review,” Jul. 01, 2024, Elsevier B.V. doi: 10.1016/j.ecoinf.2024.102634.
[21] Z. Zhou, C. Qiu, and Y. Zhang, “A comparative analysis of linear regression, neural networks and random forest regression for predicting air ozone employing soft sensor models,” Sci Rep, vol. 13, no. 1, Dec. 2023, doi: 10.1038/s41598-023-49899-0.
[22] M. A. Pienaar and K. Naidoo, “Classification and predictive models using supervised machine learning: A conceptual review,” Southern African Journal of Critical Care, p. e2937, May 2025, doi: 10.7196/sajcc.2025.v411.2937.
[23] Chen, W., Xu, D., Pan, B., Zhao, Y., & Song, Y. (2024). Machine learning-based water quality classification assessment. Environmental Informatics Letters, 5(1), 23–32.*
[24] D. Berrar, “Bayes’ theorem and naive bayes classifier,” in Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics, vol. 1–3, Elsevier, 2018, pp. 403–412. doi: 10.1016/B978-0-12-809633-8.20473-1.
[25] A. A. Khan, O. Chaudhari, and R. Chandra, “A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation,” Jun. 15, 2024, Elsevier Ltd. doi: 10.1016/j.eswa.2023.122778.
[26] M. Chen, Q. Liu, S. Chen, Y. Liu, C. H. Zhang, and R. Liu, “XGBoost-Based Algorithm Interpretation and Application on Post-Fault Transient Stability Status Prediction of Power System,” IEEE Access, vol. 7, pp. 13149–13158, 2019, doi: 10.1109/ACCESS.2019.2893448.
[27] I. Domor Mienye and Y. Sun, “Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000. A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects”, doi: 10.1109/ACCESS.2017.DOI.
[28] M. A. Ganaie, M. Hu, A. K. Malik, M. Tanveer, and P. N. Suganthan, “Ensemble deep learning: A review,” Oct. 01, 2022, Elsevier Ltd. doi: 10.1016/j.engappai.2022.105151.