Results of intermediate `PromptNodes` in a Multi-`PromptNode`-Pipelines are burried #3878

tstadel · 2023-01-17T18:12:45Z

Describe the bug
Currently it's not possible to access easily the output of multiple PromptNodes that are chained together.
E.g. the code from the docs using a "question-generation" and a "question-answering" PromptNode in sequence only returns the answers as results, but the questions (which are a intermediate result) are burried inside PromptNode's invocation_context:

from haystack.nodes.prompt import PromptTemplate, PromptNode, PromptModel

# This is to set up the OpenAI model:
from getpass import getpass

api_key_prompt = "Enter OpenAI API key:" 
api_key = getpass(api_key_prompt)

# Specify the model you want to use:
prompt_open_ai = PromptModel(model_name_or_path="text-davinci-003", api_key=api_key)

# This sets up the default model:
prompt_model = PromptModel()

# Now let make one PromptNode use the default model and the other one the OpenAI model:
node_default_model = PromptNode(prompt_model, default_prompt_template="question-generation", output_variable="questions")
node_openai = PromptNode(prompt_open_ai, default_prompt_template="question-answering")

pipeline = Pipeline()
pipeline.add_node(component=node_default_model, name="prompt_node1", inputs=["Query"])
pipe.add_node(component=node_openai, name="prompt_node_2", inputs=["prompt_node1"])
output = pipe.run(query="not relevant", documents=[Document("Berlin is the capital of Germany")])
output["results"]

would produce something like this.

["Berlin"]

while the question to the answer is burried under output["meta"]["invocation_context"]["questions"]

Expected behavior
questions and answers can be easily accessed together, e.g. by exposing PromptNode's output_variable on the root level of node_output.

FAQ Check

Have you had a look at our new FAQ page?

The text was updated successfully, but these errors were encountered:

vblagoje · 2023-01-18T14:13:06Z

We'll do this @tstadel thanks for bringing this request to our attention

masci · 2023-01-25T19:33:10Z

Since this issue was opened we made some changes to the PromptNode api, and I would argue that the intermediate output is not buried anymore. This is the current output from the pipeline run:

{
'results': ['Berlin.'],
 'invocation_context': {'questions': ['What is the capital of Germany?']},
 'questions': ['What is the capital of Germany?'],
 'root_node': 'Query',
 'params': {},
 'query': 'not relevant',
 'documents': [<Document: {'content': 'Berlin is the capital of Germany', 'content_type': 'text', 'score': None, 'meta': {}, 'embedding': None, 'id': '51b1f05adecc6e656d68af93cc40bd9c'}>],
 'node_id': 'prompt_node_2'
}

The PR linked would produce this instead:

{
'results': ['Berlin'],
 'invocation_context': {'questions': ['What is the capital of Germany?']},
 'questions': ['What is the capital of Germany?'],
 'root_node': 'Query',
 'params': {},
 'query': 'not relevant',
 'documents': [<Document: {'content': 'Berlin is the capital of Germany', 'content_type': 'text', 'score': None, 'meta': {}, 'embedding': None, 'id': '51b1f05adecc6e656d68af93cc40bd9c'}>],
 'node_id': 'prompt_node_2'
}

In my opinion having output["questions"] instead of output["invocation_context"]["questions"] at the price of the keys duplication is not worth it - if anything, to me the current version is more clear, it tells me that questions were somehow an intermediate product of the pipeline run and not the final result.

@tstadel if you still think we should move the key up to the root @vblagoje PR looks good to me, otherwise I would leave the code as is.

tstadel · 2023-01-25T20:16:39Z

I would disagree on the point that the current version is more clear. Especially when reading the docstrings of PromptNode's output_variable I would expect it to be on the root level. Additionally all other nodes also accumulate their results (e.g. retrievers in a QA pipeline also store documents on the root level which is an intermediate result too). Of course the override results of previously executed same node types. But PromptNode being a Multi-purpose node is different here.

tstadel mentioned this issue Jan 17, 2023

Improve PromptNode integration with other nodes in Pipelines #3877

Closed

vblagoje added the topic:LLM label Jan 17, 2023

vblagoje mentioned this issue Jan 19, 2023

feat: Expose output_variable in PromptNode result, adjust unit tests #3892

Merged

6 tasks

masci assigned vblagoje Jan 25, 2023

vblagoje closed this as completed in #3892 Jan 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Results of intermediate `PromptNodes` in a Multi-`PromptNode`-Pipelines are burried #3878

Results of intermediate `PromptNodes` in a Multi-`PromptNode`-Pipelines are burried #3878

tstadel commented Jan 17, 2023

vblagoje commented Jan 18, 2023

masci commented Jan 25, 2023

tstadel commented Jan 25, 2023

Results of intermediate PromptNodes in a Multi-PromptNode-Pipelines are burried #3878

Results of intermediate PromptNodes in a Multi-PromptNode-Pipelines are burried #3878

Comments

tstadel commented Jan 17, 2023

vblagoje commented Jan 18, 2023

masci commented Jan 25, 2023

tstadel commented Jan 25, 2023

Results of intermediate `PromptNodes` in a Multi-`PromptNode`-Pipelines are burried #3878

Results of intermediate `PromptNodes` in a Multi-`PromptNode`-Pipelines are burried #3878