Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
wikipedia.py		wikipedia.py

README.md

Retrieval-Enhanced Chatbot

This is a demonstration of how to enhance a chatbot using Wikipedia. We'll be using ChristophSchuhmann/wikipedia-3sentence-level-retrieval-index. for this demo. Thank Christoph for providing this resource!

In this demo, we'll be extending the approach of comparing and adding the adjacent w sentences to the matched sentence if their cosine similarity is larger than w_th. By doing so, we can provide the chatbot with a longer context, which may improve its performance.

This demo combines both the above index and the chat model into one system

Start the combined server

To get started, we need to install some dependencies and download the Wikipedia index:

Install dependencies

Install the necessary dependencies, including torch, transformers, flask, faiss, and fastparquet.

Open up wiki-server.py and set model_name_or_path to point to the path that contains the chat model
Start the retrieval server

python wiki-server.py

The server will listen on port 7003. It will download the data sets from ChristophSchuhman. This may take a few minutes.

Test the full retrieval enhanced chatbot

We now demonstrate both the wiki index and the GPT-NeoX-fine-tuned model.

curl -X POST -H 'Content-Type: application/json' https://127.0.0.1:7003/inference -d '{ "prompt" : "where is zurich located?" }'

Internally we first query the wiki index and generate a response using the provided model. To do this, We concatenate the retrieved information and the users' query into a prompt, encode it with a tokenizer, and generate a response using the chatbot model.

The response should indicate the location of Zurich city.

To test just the retrieval functionality of the system you can can do the following. Curl works as well.

import requests

endpoint = 'https://127.0.0.1:7003/search'
res = requests.post(endpoint, json={
    'query': 'Where is Zurich?',
    'k': 1,
    'w': 5,
    'w_th': 0.7,
})
print(res.json())

This should print the most relevant sentences about Zurich from Wikipedia. By increasing w and decreasing w_th, we can retrieve a longer context.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

retrieval

retrieval

README.md

Retrieval-Enhanced Chatbot

Start the combined server

Files

retrieval

Directory actions

More options

Directory actions

More options

Latest commit

History

retrieval

Folders and files

parent directory

README.md

Retrieval-Enhanced Chatbot

Start the combined server