(Local) RAG Experiment

A python script that is an experiment in using local files to augment querying a LLM (or SLM, in this case). Uses ollama and the phi3:mini model. Should be able to parse HTML, PDF, and text files, but I've only tried with HTML so far.

Runs on a Raspberry Pi 5, but is painfully slow at the moment:

takes a couple of minutes to tokenise the input files (a few hundred HTML files in my case)
can take several minutes to return an answer, depending on the query

I'd like to improve this performance one day. The idea of having an "at-home" chatbot able to pull info from my personal files sounds appealing.

Outputs the full input and API response for debugging purposes.

Built using ChatGPT (4o) and GitHub Copilot, as I've preciously only written ~20 lines of basic python code, so there's probably plenty scope for optimisation.

To get started:

Install ollama
Pull the Phi3-Mini model
Add a .env file. Set DOCS_LOCATION=<path to your source files>
Install the script dependencies: pip install -r requirements.txt
Run python dingus.py

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.vscode		.vscode
.gitignore		.gitignore
README.md		README.md
dingus.py		dingus.py
dingus2.py		dingus2.py
requirements.txt		requirements.txt
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

(Local) RAG Experiment

About

Languages

mcleodchris/rag-experiment

Folders and files

Latest commit

History

Repository files navigation

(Local) RAG Experiment

About

Topics

Resources

Stars

Watchers

Forks

Languages