Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
-
Updated
May 11, 2021 - Go
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Interesting (non-cryptographic) hashes implemented in pure Python.
A simple implementation of simhash algorithm by java.
Removes most frequent words (stop words) from a text content. Based on a Curated list of language statistics.
Dynatrace hash library for Java
Locality Sensitive Hashing
Simhash implementation in Javascript
A fast python implementation of the SimHash algorithm.
semantic-sh is a SimHash implementation to detect and group similar texts by taking power of word vectors and transformer-based language models (BERT).
SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex
基于springboot和Google开源simhash算法实现的作业查重/抄袭检测/文本相似度分析可视化系统,,集成jplag、MOSS、singleCloud工具套件进行多方位查重 Ref: https://github.com/ALuShu/checksystem
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
A library for cosine similarity & simhash calculation
Add a description, image, and links to the simhash topic page so that developers can more easily learn about it.
To associate your repository with the simhash topic, visit your repo's landing page and select "manage topics."