Skip to content

JaishreeJanu/document-retrieval-web-app

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Django application for information retrieval from large document collection

  • This application lets us search text documents for given text query by leveraging benefits of building inverted index prior to information access.
  • For each term, there is a list of doc IDs where that term occurs.
  • This list is called postings list. We take all postings list of query terms (user query) and merge(intersection for conjunctive query and union for disjunctive query) them to find documents containing query terms.
  • You can read the blog here to know more about inverted index and how to create one(step by step).

Development setup

  • Initialize virtual environment
virtualenv --no-site-packages virtual_env
$ source env/scripts/activate

About

Django app for searching documents

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published