Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
-
Updated
Jun 1, 2022 - Java
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Lucene token filter that removes trailing stopwords from shingles.
Math 140 project, I only wrote the grammify method. My professor provided the rest of the code.
Add a description, image, and links to the shingles topic page so that developers can more easily learn about it.
To associate your repository with the shingles topic, visit your repo's landing page and select "manage topics."