weshouman / tut-py-irtx Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Information Retrieval and Text Mining Tutorials

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
out		out
tests		tests
tut_py_irtx		tut_py_irtx
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Repository files navigation

tut-py-irtx

Implementation of Information Retrieval and Text Mining algorithms including:

Indexers:
- Inverted
- KGram
Boolean retrieval
WildCard retrieval
Distance calculation
Ranking based retrieval (cosine-similarity and tf-idf)
Perceptron classification
Multiple confusion matrix stats
KMeans Clustering, with RSS based optimization

Contribution Style

The tests are run using xmlrunner (following the unittest style).
The documentation style is NumPy/SciPy Docstrings.
Extensive Debugging logging.debug() calls are commented.

About

Information Retrieval and Text Mining Tutorials

Report repository

Releases

No releases published

Packages

No packages published

Contributors 2

Languages