Chat with your documents offline using AI. No data leaves your system. Internet connection is only required to install the tool and download the AI models. It is based on PrivateGPT but has more features.
- Supports GGML models via C Transformers
- Supports 🤗 Transformers models
- Supports GPTQ models
- Web UI
- GPU support
- Highly configurable via
chatdocs.yml
Show supported document types
Extension | Format |
---|---|
.csv |
CSV |
.docx , .doc |
Word Document |
.enex |
EverNote |
.eml |
|
.epub |
EPub |
.html |
HTML |
.md |
Markdown |
.msg |
Outlook Message |
.odt |
Open Document Text |
.pdf |
Portable Document Format (PDF) |
.pptx , .ppt |
PowerPoint Document |
.txt |
Text file (UTF-8) |
Install the tool using:
pip install chatdocs
Download the AI models using:
chatdocs download
Now it can be run offline without internet connection.
Add a directory containing documents to chat with using:
chatdocs add /path/to/documents
The processed documents will be stored in
db
directory by default.
Chat with your documents using:
chatdocs ui
Open https://localhost:5000 in your browser to access the web UI.
It also has a nice command-line interface:
chatdocs chat
All the configuration options can be changed using the chatdocs.yml
config file. Create a chatdocs.yml
file in some directory and run all commands from that directory. For reference, see the default chatdocs.yml
file.
You don't have to copy the entire file, just add the config options you want to change as it will be merged with the default config. For example, see tests/fixtures/chatdocs.yml
which changes only some of the config options.
To change the embeddings model, add and change the following in your chatdocs.yml
:
embeddings:
model: hkunlp/instructor-large
Note: When you change the embeddings model, delete the
db
directory and add documents again.
To change the C Transformers GGML model, add and change the following in your chatdocs.yml
:
ctransformers:
model: TheBloke/Wizard-Vicuna-7B-Uncensored-GGML
model_file: Wizard-Vicuna-7B-Uncensored.ggmlv3.q4_0.bin
model_type: llama
Note: When you add a new model for the first time, run
chatdocs download
to download the model before using it.
You can also use an existing local model file:
ctransformers:
model: /path/to/ggml-model.bin
model_type: llama
To use 🤗 Transformers models, add the following to your chatdocs.yml
:
llm: huggingface
To change the 🤗 Transformers model, add and change the following in your chatdocs.yml
:
huggingface:
model: TheBloke/Wizard-Vicuna-7B-Uncensored-HF
Note: When you add a new model for the first time, run
chatdocs download
to download the model before using it.
To use GPTQ models, install the auto-gptq
package using:
pip install chatdocs[gptq]
and add the following to your chatdocs.yml
:
llm: gptq
To change the GPTQ model, add and change the following in your chatdocs.yml
:
gptq:
model: TheBloke/Wizard-Vicuna-7B-Uncensored-GPTQ
model_file: Wizard-Vicuna-7B-Uncensored-GPTQ-4bit-128g.no-act-order.safetensors
Note: When you add a new model for the first time, run
chatdocs download
to download the model before using it.
To enable GPU (CUDA) support for the embeddings model, add the following to your chatdocs.yml
:
embeddings:
model_kwargs:
device: cuda
You may have to reinstall PyTorch with CUDA enabled by following the instructions here.
Note: Currently only LLaMA GGML models have GPU support.
To enable GPU (CUDA) support for the C Transformers GGML model, add the following to your chatdocs.yml
:
ctransformers:
config:
gpu_layers: 50
You should also reinstall the ctransformers
package with CUDA enabled:
pip uninstall ctransformers --yes
CT_CUBLAS=1 pip install ctransformers --no-binary ctransformers
Show commands for Windows
On Windows PowerShell run:
$env:CT_CUBLAS=1
pip uninstall ctransformers --yes
pip install ctransformers --no-binary ctransformers
On Windows Command Prompt run:
set CT_CUBLAS=1
pip uninstall ctransformers --yes
pip install ctransformers --no-binary ctransformers
To enable GPU (CUDA) support for the 🤗 Transformers model, add the following to your chatdocs.yml
:
huggingface:
device: 0
You may have to reinstall PyTorch with CUDA enabled by following the instructions here.
To enable GPU (CUDA) support for the GPTQ model, add the following to your chatdocs.yml
:
gptq:
device: 0
You may have to reinstall PyTorch with CUDA enabled by following the instructions here.
After installing PyTorch with CUDA enabled, you should also reinstall the auto-gptq
package:
pip uninstall auto-gptq --yes
pip install chatdocs[gptq]