Skip to content
/ DocSim Public

Minhash text analyzer developed during Algorithmics subject.

Notifications You must be signed in to change notification settings

mariofv/DocSim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DocSim

Docsim is a text analyzer based in minhash. It estimates the similarity between two given texts with the Jaccard Similarity. It was developed during Algorithmics subject in the Bachelor's Degree in Computer's Engineering of the Computer School of Barcelona of the Politechnical University of Catalonia.

Structure

  • Source code of the analyzer is located in the folder src.
  • The texts used in the application testing are located in the folder data.
  • The results of different experiments are located in the folder results.

How to use it

Run the main classes located in the src folder. There are some texts to analyze by default, you are free to modify the paths in order to analyze other texts.

Documentation

Documentation of the language is located in DOCUMENTACIÓN DOCSIM.pdf.

Releases

No releases published

Packages

No packages published