IMPLEMENTASI METODE AGGLOMERATIVE HIERARCHICAL CLUSTERING UNTUK SISTEM REKOMENDASI FILM

Main Article Content

Vanesa Nellie
Viny Christanti Mawardi
Novario Jaya Perdana

Abstract

People can now watch movies on their cellphones or other devices using applications, in addition to watching them on television or in theaters. The user's entered keywords are used as the basis for a system that suggests movies from among the many that have appeared over time. Later, similarity between these keywords and text data, such as movie titles and descriptions, will be assessed. This recommendation system will include preprocessing, and the TF-IDF method will be used to determine the weight value. After the weight values have been determined, the grouping calculations will be performed using agglomerative hierarchical clustering. Previously, the Manhattan Distance method will be used to calculate the distance. After that, the distance that is closest can be determined. The data will be clustered according to the shortest distance once the distance calculation is complete. Following that, the system will display the grouping as a dendrogram. The data used was updated as of the date of scraping, which is November 25, 2022, and contains a total of 2467 data. The Agglomerative Hierarchical Clustering method yielded the best silhouette coefficient value, 0.5025559374455285, forming 20 clusters.

Article Details

Section
Articles

References

indy Winata; Viny Christanti Mawardi; dan Janson Hendryli, Movilizer Application with Genre and Rating Classification Using NW-KNN Method, 3rd Asia Pacific Information Technology Conference, h. 33-38, Januari 2021.

Devin Abipraya; Viny Christanti Mawardi; dan Novario Jaya Perdana, Implementasi Opinion Mining Untuk Provider Internet Menggunakan Metode Naïve Bayes, Jurnal Ilmu Komputer dan Sistem Informasi, Vol.9, No.2, 2021.

Online Edition 2009 Cambridge, Scoring, Term Weighting, and the vector Space Model, doi:10.1017/CBO9780511809071.007, ISBN 978-0-511-80907-1, h. 128.

Jiawei Han; Micheline Kamber; dan Jian Pei, Data Mining Concepts and Technique, Waltham: Morgan Kaufmann Publications, h. 457-459, 2012.

M. Nishom, “Perbandingan Akurasi Euclidean Distance, Minkowski Distance, dan Manhattan Distance pada Algoritma K-Means Clustering berbasis Chi-Square”, Jurnal Informatika: Jurnal Pengembangan IT (JPIT), Vol. 4, Nomor 1, h. 20-24, Januari 2019.

Edy Susanto; Viny Christanti Mawardi; dan Manatap Dolok Lauro, Aplikasi Clustering Berita dengan Metode K-Means dan Peringkas Berita dengan Metode Maximum Marginal Relevance, Jurnal Ilmu Komputer dan Sistem Informasi, Vol.9, No.1, h.62-68, 2021.