[Question]: How to create a querypipeline to chat with text docs and sql tables. #13775

JonOnEarth · 2024-05-28T15:31:23Z

Question Validation

I have searched both the documentation and discord for an answer.

Question

I have followed these two advanced examples [1, 2] to successfully chat with SQL tables, and text docs separately.
How can I stitch them together? chat with multiple SQL tables and text docs at the same time.
The process should be to choose the correct docs based on the query, and then chat with the chosen doc or table. If it's the table, then need the text-to-sql component as well.
Besides, if I ask irrelevant queries with these documents, how can just use the LLM's reply instead of always searching the documents?

The text was updated successfully, but these errors were encountered:

logan-markewich · 2024-05-28T15:36:32Z

Basically add some kind of router to route between the paths. Query pipelines support conditional links

JonOnEarth · 2024-05-28T23:19:22Z

Thanks @logan-markewich. I am new to LlamaIndex, and still have lots of confusion, just try to learn from the examples and real problems.
Here is what I did and an error in the end.

I combined the two examples together as the basic queryPipeline like this:
1st example for SQL tables:

qp = QP(
    modules={
        "input": InputComponent(),
        "table_retriever": obj_retriever,
        "table_output_parser": table_parser_component,
        "text2sql_prompt": text2sql_prompt,
        "text2sql_llm": llm,
        "sql_output_parser": sql_parser_component,
        "sql_retriever": sql_retriever,
        "response_synthesis_prompt": response_synthesis_prompt,
        "response_synthesis_llm": llm,
    },
    verbose=True,
)
qp.add_chain(["input", "table_retriever", "table_output_parser"])
qp.add_link("input", "text2sql_prompt", dest_key="query_str")
qp.add_link("table_output_parser", "text2sql_prompt", dest_key="schema")
qp.add_chain(
    ["text2sql_prompt", "text2sql_llm", "sql_output_parser", "sql_retriever"]
)
qp.add_link(
    "sql_output_parser", "response_synthesis_prompt", dest_key="sql_query"
)
qp.add_link(
    "sql_retriever", "response_synthesis_prompt", dest_key="context_str"
)
qp.add_link("input", "response_synthesis_prompt", dest_key="query_str")
qp.add_link("response_synthesis_prompt", "response_synthesis_llm")

2nd example for text RAG:

p = QueryPipeline(verbose=True)
p.add_modules(
    {
        "input": InputComponent(),
        "retriever": retriever,
        "summarizer": summarizer,
    }
)
p.add_link("input", "retriever")
p.add_link("input", "summarizer", dest_key="query_str")
p.add_link("retriever", "summarizer", dest_key="nodes")

I add a 3rd querypipeline just use the llm:

qp_llm = QueryPipeline(
    modules={
        "llm": llm2,
    },
    verbose=True,
)

Then I add the router as:

# define selector
selector = LLMSingleSelector.from_defaults()
choices = [
    "This tool answers questions related to wiki tables data",
    "This tool contains the knowledge about Paul Graham",
    "This tool only uses LLM itself to answer the questions non-related to Paul Graham and wiki tables data. "
]
router_c = RouterComponent(
    selector=selector,
    choices=choices,
    components=[p, qp_llm], #qp
    verbose=True,
)
# top-level pipeline
qp_t = QueryPipeline(chain=[router_c], verbose=True)

It returns the error:

[/usr/local/lib/python3.10/dist-packages/llama_index/core/query_pipeline/components/router.py](https://localhost:8080/#) in __init__(self, selector, choices, components, verbose)
    109             # validate component has one input key
    110             if len(new_component.free_req_input_keys) != 1:
--> 111                 raise ValueError("Expected one required input key")
    112             query_keys.append(next(iter(new_component.free_req_input_keys)))
    113             new_components.append(new_component)

ValueError: Expected one required input key

lazyFrogLOL · 2024-07-01T09:48:21Z

Thanks @logan-markewich. I am new to LlamaIndex, and still have lots of confusion, just try to learn from the examples and real problems. Here is what I did and an error in the end.

I combined the two examples together as the basic queryPipeline like this: 1st example for SQL tables:

qp = QP(
    modules={
        "input": InputComponent(),
        "table_retriever": obj_retriever,
        "table_output_parser": table_parser_component,
        "text2sql_prompt": text2sql_prompt,
        "text2sql_llm": llm,
        "sql_output_parser": sql_parser_component,
        "sql_retriever": sql_retriever,
        "response_synthesis_prompt": response_synthesis_prompt,
        "response_synthesis_llm": llm,
    },
    verbose=True,
)
qp.add_chain(["input", "table_retriever", "table_output_parser"])
qp.add_link("input", "text2sql_prompt", dest_key="query_str")
qp.add_link("table_output_parser", "text2sql_prompt", dest_key="schema")
qp.add_chain(
    ["text2sql_prompt", "text2sql_llm", "sql_output_parser", "sql_retriever"]
)
qp.add_link(
    "sql_output_parser", "response_synthesis_prompt", dest_key="sql_query"
)
qp.add_link(
    "sql_retriever", "response_synthesis_prompt", dest_key="context_str"
)
qp.add_link("input", "response_synthesis_prompt", dest_key="query_str")
qp.add_link("response_synthesis_prompt", "response_synthesis_llm")

2nd example for text RAG:

p = QueryPipeline(verbose=True)
p.add_modules(
    {
        "input": InputComponent(),
        "retriever": retriever,
        "summarizer": summarizer,
    }
)
p.add_link("input", "retriever")
p.add_link("input", "summarizer", dest_key="query_str")
p.add_link("retriever", "summarizer", dest_key="nodes")

I add a 3rd querypipeline just use the llm:

qp_llm = QueryPipeline(
    modules={
        "llm": llm2,
    },
    verbose=True,
)

Then I add the router as:

# define selector
selector = LLMSingleSelector.from_defaults()
choices = [
    "This tool answers questions related to wiki tables data",
    "This tool contains the knowledge about Paul Graham",
    "This tool only uses LLM itself to answer the questions non-related to Paul Graham and wiki tables data. "
]
router_c = RouterComponent(
    selector=selector,
    choices=choices,
    components=[p, qp_llm], #qp
    verbose=True,
)
# top-level pipeline
qp_t = QueryPipeline(chain=[router_c], verbose=True)

It returns the error:

[/usr/local/lib/python3.10/dist-packages/llama_index/core/query_pipeline/components/router.py](https://localhost:8080/#) in __init__(self, selector, choices, components, verbose)
    109             # validate component has one input key
    110             if len(new_component.free_req_input_keys) != 1:
--> 111                 raise ValueError("Expected one required input key")
    112             query_keys.append(next(iter(new_component.free_req_input_keys)))
    113             new_components.append(new_component)

ValueError: Expected one required input key

Any solution for this issue? I meet it as well. How to define a free_req_input_key for QueryPipeline modules?

lazyFrogLOL · 2024-07-01T11:01:13Z

@JonOnEarth
I solved the issue by replacing InputComponent() with a specific PromptTemplate object.

prompt_str = "{query}"
prompt_tmpl = PromptTemplate(prompt_str)
p = QueryPipeline(verbose=True)
p.add_modules(
    {
        "input": prompt_tmpl,
        "retriever": retriever,
        "summarizer": summarizer,
    }
)
p.add_link("input", "retriever")
p.add_link("input", "summarizer", dest_key="query_str")
p.add_link("retriever", "summarizer", dest_key="nodes")

You can try using this code.

JonOnEarth · 2024-07-01T18:41:27Z

@JonOnEarth I solved the issue by replacing InputComponent() with a specific PromptTemplate object.

prompt_str = "{query}"
prompt_tmpl = PromptTemplate(prompt_str)
p = QueryPipeline(verbose=True)
p.add_modules(
    {
        "input": prompt_tmpl,
        "retriever": retriever,
        "summarizer": summarizer,
    }
)
p.add_link("input", "retriever")
p.add_link("input", "summarizer", dest_key="query_str")
p.add_link("retriever", "summarizer", dest_key="nodes")

You can try using this code.

@lazyFrogLOL Thanks for the solution.

JonOnEarth added the question Further information is requested label May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: How to create a querypipeline to chat with text docs and sql tables. #13775

[Question]: How to create a querypipeline to chat with text docs and sql tables. #13775

JonOnEarth commented May 28, 2024

logan-markewich commented May 28, 2024

JonOnEarth commented May 28, 2024

lazyFrogLOL commented Jul 1, 2024

lazyFrogLOL commented Jul 1, 2024

JonOnEarth commented Jul 1, 2024

[Question]: How to create a querypipeline to chat with text docs and sql tables. #13775

[Question]: How to create a querypipeline to chat with text docs and sql tables. #13775

Comments

JonOnEarth commented May 28, 2024

Question Validation

Question

logan-markewich commented May 28, 2024

JonOnEarth commented May 28, 2024

lazyFrogLOL commented Jul 1, 2024

lazyFrogLOL commented Jul 1, 2024

JonOnEarth commented Jul 1, 2024