PENDETEKSIAN KESALAHAN KETIK DENGAN DAMERAU-LEVENSHTEIN DISTANCE DAN TRIE
Main Article Content
Abstract
Typographical errors are commonly found in text. Many applications implement a spell checking feature to detect and correct typographical errors. Spell checking requires an algorithm to calculate the similarity of two strings. This study compares Damerau-Levenshtein Distance and Trie in checking and correcting typographical errors in the names of function calls in source code based on the processing time and accuracy of the spelling correction. Accuracy is calculated by classifying the results of the spelling correction in a Confusion Matrix. This study shows that Trie is faster than Damerau-Levenshtein Distance, in which Trie’s processing time took 10.07% of Damerau-Levenshtein Distance’s. However, Damerau-Levenshtein Distance can correct more types of typographical errors than Trie, yielding an accuracy of 89.7% compared to 45.71%.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This work is licensed under a Jurnal Komunikasi Creative Commons Attribution-ShareAlike 4.0 International License.
References
[1] D. Chicco and G. Jurman, ‘The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation’, BMC genomics, vol. 21, pp. 1–13, 2020.
[2] Kvzaytsev, ‘Title’, GitHub, https://kvzaytsev.github.io/trie-visualizer/.
[3] L. Boytsov, ‘Indexing methods for approximate dictionary searching: Comparative analysis’, Journal of Experimental Algorithmics, vol. 16, p. 1, 2011.