Skip to content

Text based information retrieval system using Locality Sensitive Hashing.

Notifications You must be signed in to change notification settings

shrishtrip/Locality-Sensitive-Hashing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Locality-Sensitive-Hashing

Text based information retrieval system using Locality Sensitive Hashing.

  • Here LSH is implemented on the https://millionsongdataset.com/musixmatch/ dataset.
  • 3 distance metric is used:
    • Euclidean distance
    • Hamming distance
    • Angle distance
  • For each distance metric, a family of hash function is used (which is locality sensitive), which hashes similar documents to same buckets.
  • The union of all the buckets to which the query gets hashed, gives us the candidates for similarilty calculation.

About

Text based information retrieval system using Locality Sensitive Hashing.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages