Skip to content

weshouman/tut-py-irtx

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

72 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

tut-py-irtx

Implementation of Information Retrieval and Text Mining algorithms including:

  • Indexers:
    • Inverted
    • KGram
  • Boolean retrieval
  • WildCard retrieval
  • Distance calculation
  • Ranking based retrieval (cosine-similarity and tf-idf)
  • Perceptron classification
  • Multiple confusion matrix stats
  • KMeans Clustering, with RSS based optimization

Contribution Style

  • The tests are run using xmlrunner (following the unittest style).

  • The documentation style is NumPy/SciPy Docstrings.

  • Extensive Debugging logging.debug() calls are commented.

About

Information Retrieval and Text Mining Tutorials

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published