- This application lets us search text documents for given text query by leveraging benefits of building inverted index prior to information access.
- For each term, there is a list of doc IDs where that term occurs.
- This list is called postings list. We take all postings list of query terms (user query) and merge(intersection for conjunctive query and union for disjunctive query) them to find documents containing query terms.
- You can read the blog here to know more about inverted index and how to create one(step by step).
- Initialize virtual environment
virtualenv --no-site-packages virtual_env
$ source env/scripts/activate
- Run
python3 manage.py runserver
- Go to https://localhost:8000/search in the browser
- Add documents and search for any terms