feat: add `top_k` to `PromptNode` #4159

tstadel · 2023-02-14T17:56:39Z

Related Issues

fixes Add top_k to PromptNode #4158

Proposed Changes:

add top_k parameter to PromptNode
push down top_k to invocation layers and map them to their invocation interface

How did you test it?

added tests

Notes for the reviewer

I don't know why test/nodes/test_audio.py::TestTextToSpeech::test_text_to_speech_compress_audio fails. The code didn't touch any of TestToSpeech. Also other PR CIs are failing because of this. Maybe a dependency update?...

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

tstadel · 2023-02-14T17:58:59Z

haystack/nodes/prompt/prompt_node.py

@@ -979,4 +988,4 @@ def run_batch(
 def _prepare_model_kwargs(self):
 # these are the parameters from PromptNode level
 # that are passed to the prompt model invocation layer
- return {"stop_words": self.stop_words}
+ return {"stop_words": self.stop_words, "top_k": self.top_k}


Will be pushed down to PromptModel.invoke and InvocationLayer.invoke

tstadel · 2023-02-14T17:59:31Z

haystack/nodes/prompt/prompt_node.py

+ if "top_k" in kwargs:
+ kwargs["n"] = kwargs.pop("top_k")


Handling in OpenAIInvocationLayer

tstadel · 2023-02-14T17:59:54Z

haystack/nodes/prompt/prompt_node.py

+ if top_k:
+ model_input_kwargs["num_return_sequences"] = top_k
+ model_input_kwargs["num_beams"] = top_k


Handling in HFInvocationLayer

In AnswerGenerator we warn if num_beams < num_return_sequences. If the user wants to control them separately, she can do so via model_kwargs, but I can't even imagine why one would do so.

vblagoje · 2023-02-15T10:27:46Z

Yeah this approach is correct @tstadel Handle top_k on a conceptual level, but each implementation layer handles the specifics of the top_k concept appropriately.

tstadel · 2023-02-15T16:19:15Z

#4151 will merge first to master. I will resolve conflicts if they arise.

vblagoje · 2023-02-16T13:37:20Z

Repeating comment here for visibility @tstadel @sjrl - could we call this parameter n? top_k is more related to search/retrieval, no?

tstadel · 2023-02-16T13:39:52Z

Repeating comment here for visibility @tstadel @sjrl - could we call this parameter n? top_k is more related to search/retrieval, no?

I would go with top_k as it's also being used in generators and readers, too. I think for consistency in Haystack it should be top_k.

vblagoje · 2023-02-16T14:01:44Z

Ok, if @sjrl doesn't object let's go with top_k

sjrl · 2023-02-16T14:15:32Z

Sounds good to me!

sjrl · 2023-02-20T11:48:10Z

test/nodes/test_prompt_node.py

+
+ pipe = Pipeline()
+ pipe.add_node(component=node, name="prompt_node", inputs=["Query"])
+ result = pipe.run(query="not relevant", documents=[Document("Berlin is the capital of Germany")])


Thanks for adding the test! I wanted to ask if it would make more sense to be setting query=None instead of "not relevant" for general use case when using a node that ignores the query? Doesn't matter for tests though, just wanted to ask.

@sjrl I think there is still something that required the query to be set. Ideally, we shouldn't have to add query add all here. As long query is required, it doesn't make a difference, I'd say.

sjrl

Looks good! Thanks for adding this feature!

sjrl · 2023-02-20T11:51:04Z

@tstadel Please update your branch with main and rerun CI, then it should be good to go!

* add top_k to PromptNode * fix OpenAI * fix openai test

add top_k to PromptNode

aa5341d

github-actions bot added topic:LLM topic:tests type:documentation Improvements on the docs labels Feb 14, 2023

tstadel commented Feb 14, 2023

View reviewed changes

tstadel and others added 2 commits February 14, 2023 20:08

fix OpenAI

b6476b7

fix openai test

6e8b22a

tstadel marked this pull request as ready for review February 15, 2023 09:22

tstadel requested a review from a team as a code owner February 15, 2023 09:22

tstadel requested review from silvanocerza and removed request for a team February 15, 2023 09:22

vblagoje mentioned this pull request Feb 15, 2023

feat: Add model_kwargs option to PromptNode #4151

Merged

6 tasks

Merge branch 'main' into feat/promptnode_topk

a94ab3f

tstadel added 2 commits February 16, 2023 19:56

Merge branch 'main' into feat/promptnode_topk

fcc017f

Merge branch 'main' into feat/promptnode_topk

b996a43

tstadel requested a review from sjrl February 20, 2023 09:44

Merge branch 'main' into feat/promptnode_topk

c36c5be

sjrl reviewed Feb 20, 2023

View reviewed changes

sjrl approved these changes Feb 20, 2023

View reviewed changes

Merge branch 'main' into feat/promptnode_topk

79f8f13

tstadel merged commit 14578aa into main Feb 20, 2023

tstadel deleted the feat/promptnode_topk branch February 20, 2023 13:51

tstadel added this to the 1.14.0 milestone Feb 21, 2023

vblagoje pushed a commit that referenced this pull request Feb 22, 2023

feat: add top_k to PromptNode (#4159)

7851e59

* add top_k to PromptNode * fix OpenAI * fix openai test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add `top_k` to `PromptNode` #4159

feat: add `top_k` to `PromptNode` #4159

tstadel commented Feb 14, 2023 •

edited

Loading

tstadel Feb 14, 2023

tstadel Feb 14, 2023

tstadel Feb 14, 2023

tstadel Feb 14, 2023

vblagoje commented Feb 15, 2023

tstadel commented Feb 15, 2023

vblagoje commented Feb 16, 2023

tstadel commented Feb 16, 2023 •

edited

Loading

vblagoje commented Feb 16, 2023

sjrl commented Feb 16, 2023

sjrl Feb 20, 2023

tstadel Feb 20, 2023

sjrl left a comment

sjrl commented Feb 20, 2023

feat: add top_k to PromptNode #4159

feat: add top_k to PromptNode #4159

Conversation

tstadel commented Feb 14, 2023 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

tstadel Feb 14, 2023

Choose a reason for hiding this comment

tstadel Feb 14, 2023

Choose a reason for hiding this comment

tstadel Feb 14, 2023

Choose a reason for hiding this comment

tstadel Feb 14, 2023

Choose a reason for hiding this comment

vblagoje commented Feb 15, 2023

tstadel commented Feb 15, 2023

vblagoje commented Feb 16, 2023

tstadel commented Feb 16, 2023 • edited Loading

vblagoje commented Feb 16, 2023

sjrl commented Feb 16, 2023

sjrl Feb 20, 2023

Choose a reason for hiding this comment

tstadel Feb 20, 2023

Choose a reason for hiding this comment

sjrl left a comment

Choose a reason for hiding this comment

sjrl commented Feb 20, 2023

feat: add `top_k` to `PromptNode` #4159

feat: add `top_k` to `PromptNode` #4159

tstadel commented Feb 14, 2023 •

edited

Loading

tstadel commented Feb 16, 2023 •

edited

Loading