Simple rag project

Description

A brief description of your project goes here.

Setup & Retrieval

To retrieve data use --retrieve flag and choose the correct name of the database with --db_name. The nokia_rag database is already setup for vector search and ready for querying. Please use this database name only for execution of the below command as the code runs with admin permissions. python main.py --retrieve --setup database_medium --db_name nokia_rag

Reproduction

Data preparation To prepare the data for the document database python main.py --setup data --load_csv_source_filename "example.csv" --save_json_source_filename "example_org.json" --save_json_embeddings_filename "example_chunked_embeddings.json"

The "example.csv" file is small just to show that the code runs. To run the code on the medium articles dataset set "medium.csv" as the source filename.

Note: The the right filenams need to be used when launching the database.

To launch the database for the first time (done once per database). Inserting the data into the database. python main.py --launch --setup database_medium --db_name <unsearchable_demo_database_name> --medium_json_filename "example_org.json" --embeddings_json_filename "example_chunked_embeddings.json"

Note: Unfortunately for the free cluster tier of the atlas databse I chose it is not possible to set search index in the database from code, so the database setup cannot be fulle reproduced with just this repo. Some interaction with UI is necessary. For this reason for retrieval use already set up nokia_rag database.

Running tests

Use pytest command, but the test are likely broken as for now...

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
src		src
tests		tests
.env		.env
.gitingore		.gitingore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py
pytest.toml		pytest.toml
rag_report.pdf		rag_report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple rag project

Description

Setup & Retrieval

Reproduction

Running tests

About

Releases

Packages

Languages

License

pszmk/rag

Folders and files

Latest commit

History

Repository files navigation

Simple rag project

Description

Setup & Retrieval

Reproduction

Running tests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages