How to retrieve fewer question-answer pairs from chromaDB #218

philippschw · 2024-01-30T15:57:48Z

I am running into token limits, is it possible to limit the number of question-answer pairs that are being retrieved when running the function get_similar_question_sql ?

I see chromadb has the argument: n_results=10. Just struggling to actually make use of it in the vanna.ai framework.

Thanks!

Nuclear6 · 2024-02-27T02:34:34Z

I also found this problem. It seems that irrelevant items can be retrieved. Is there any improvement method here?

andreped · 2024-02-27T09:21:17Z

@philippschw Hmm, I tried looking for this, and I also struggle to see where in the code this is actually set. Maybe setting n_results is a deprecated feature, or just really hidden in the VannaBase class? Any comments, @zainhoda?

I know this does not answer the original question, but a PR to add support for setting max_tokens was just merged yesterday.
See bb0b4cc. Previously this was hardcoded to 500 for the OpenAI client, which would explain why you are running out of tokens.

To test if the latest changes resolve your issue, install Vanna from source:

pip install git+https://github.com/vanna-ai/vanna.git

and then you can set max_tokens through the config argument when initializing the client.

zainhoda · 2024-02-27T14:29:15Z

Chroma has an option for n_results with the default being 10
https://docs.trychroma.com/reference/Collection#query

Since we don't specify the n_results, it uses the default:
https://github.com/vanna-ai/vanna/blob/main/src/vanna/chromadb/chromadb_vector.py#L230-L232

Ideally we should have some config params to set n_ddl, n_documentation, and n_question_sql to retrieve a specific number of items.

In the meantime, as an end user you can do this without forking the package by simply overriding the get_similar_question_sql function when you set up the class

from vanna.openai.openai_chat import OpenAI_Chat
from vanna.chromadb.chromadb_vector import ChromaDB_VectorStore

class MyVanna(ChromaDB_VectorStore, OpenAI_Chat):
    def __init__(self, config=None):
        ChromaDB_VectorStore.__init__(self, config=config)
        OpenAI_Chat.__init__(self, config=config)

    def get_similar_question_sql(self, question: str, **kwargs) -> list:
        return ChromaDB_VectorStore._extract_documents(
            self.sql_collection.query(
                query_texts=[question],
                n_results=5, # Or whatever number you want
            )
        )

vn = MyVanna(config={'api_key': 'sk-...', 'model': 'gpt-3.5-turbo'})

andreped · 2024-02-27T14:33:12Z

Ideally we should have some config params to set n_ddl, n_documentation, and n_question_sql to retrieve a specific number of items.

@zainhoda I can make a PR to add support for this to set the n_results through the config argument, if you want? :]

zainhoda · 2024-02-27T14:35:55Z

@andreped you are the best contributor an open-source project could ever hope for 🚀

if you have time and inclination, feel free -- I'll batch all of your PRs into the next release

andreped · 2024-02-27T15:00:03Z

Ideally we should have some config params to set n_ddl, n_documentation, and n_question_sql to retrieve a specific number of items.

@zainhoda I just saw this. Do we only want to set the n_results for the self.sql_collection.query() only, or did we want to be able to set this for the get_related_ddl() and get_related_documentation() methods as well?

If we want to set n_results for all three, should it be three different n_results values that can be set, or should it just be the same for all?

andreped · 2024-02-27T15:06:13Z

I made a PR, feel free to review directly there, @zainhoda :]

andreped mentioned this issue Feb 27, 2024

Add support for setting number of retrieved similar sql queries in chroma #268

Merged

zainhoda closed this as completed in #268 Mar 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to retrieve fewer question-answer pairs from chromaDB #218

How to retrieve fewer question-answer pairs from chromaDB #218

philippschw commented Jan 30, 2024

Nuclear6 commented Feb 27, 2024

andreped commented Feb 27, 2024 •

edited

Loading

zainhoda commented Feb 27, 2024 •

edited

Loading

andreped commented Feb 27, 2024 •

edited

Loading

zainhoda commented Feb 27, 2024

andreped commented Feb 27, 2024 •

edited

Loading

andreped commented Feb 27, 2024

How to retrieve fewer question-answer pairs from chromaDB #218

How to retrieve fewer question-answer pairs from chromaDB #218

Comments

philippschw commented Jan 30, 2024

Nuclear6 commented Feb 27, 2024

andreped commented Feb 27, 2024 • edited Loading

zainhoda commented Feb 27, 2024 • edited Loading

andreped commented Feb 27, 2024 • edited Loading

zainhoda commented Feb 27, 2024

andreped commented Feb 27, 2024 • edited Loading

andreped commented Feb 27, 2024

andreped commented Feb 27, 2024 •

edited

Loading

zainhoda commented Feb 27, 2024 •

edited

Loading

andreped commented Feb 27, 2024 •

edited

Loading

andreped commented Feb 27, 2024 •

edited

Loading