Fix: Add rate limit to OpenAIAnswerGenerator, change token computation #3078

Timoeller · 2022-08-19T18:49:21Z

Related Issues

fixes running into rate limit with OpenAI's GPT-3 API when used during evaluation #3068
A related PR adjusting the temperatur param as well: fix: type of temperature param and adjust defaults for OpenAIAnswerGenerator #3073

Proposed Changes:

Added a rate limit since otherwise the API will crash eventually.
Added a try catch around the API call since OpenAIs API sometimes returns errors
Changed prompt length computation, since one has to take the output length into account as well

How did you test it?

manual verification

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used the conventional commit convention for my PR title
I documented my code
I ran pre-commit hooks and fixed any issue

tholor · 2022-08-22T07:13:03Z

haystack/nodes/answer_generator/openai.py

 presence_penalty: float = -2.0,
 frequency_penalty: float = -2.0,
+ max_calls_per_minute: int = 60,


Shouldn't we make the default here 3000? I'd prefer having here the "production" setting for a paid OpenAI account. For people on the free trial we can make the error message that you added further down more descriptive.

tholor · 2022-08-22T07:15:51Z

haystack/nodes/answer_generator/openai.py

+ try:
+ response = requests.request("POST", url, headers=headers, data=json.dumps(payload))
+ time.sleep(60 / self.max_calls_per_minute)
+ except Exception as e:


Can we catch the exception type / API response code for rate-limits? If yes, we can make the error message here more actionable like "Seems like the OpenAI API rate-limited your requests. See limits of plans here (link) and consider setting max_calls_per_minute to a lower value (currently: XX) when initializing the OpenAIAnswerGenerator node in haystack`

Sorry, this try catch is was not meant for catching the rate limit.
But I see the value of catching the rate limit as you suggest.

I think there is the need to catch more general problems, since when I was running eval after adjusting the rate limit, the API also ran into errors from time to time. That's why here is a very general try-catch block.

bogdankostic

I agree with @tholor's comments. Also, we need to fix the import order as requested by pylint and generate new schemas.

bogdankostic · 2022-08-24T15:17:37Z

haystack/nodes/answer_generator/openai.py

@@ -191,7 +209,7 @@ def _build_prompt(self, query: str, documents: List[Document]) -> Tuple[str, Lis
 )

 # Top ranked documents should go at the end
- context = " ".join(reversed(input_docs_content))
+ context = "\n".join(reversed(input_docs_content))


We use " " here as this is what's used in the transition guide for transition from answers endpoint to completions endpoint (see here for the transition guide and here for the place where " " is used).

What was your motivation for changing this to "\n"? Anyway, I assume that this change won't affect the the way the answers are produced, as tokenization should treat both " " and "\n" the same way.

My motivation was to make it more readable. With the newline you can see very quickly the different articles in the context.

Timoeller · 2022-08-25T16:27:09Z

Sorry, I realize I won't be able to work on this the next week. I can either look into it afterward or you start taking over?

Timoeller · 2022-10-21T09:56:59Z

Closing this in favor of #3398

Add rate limit to OpenAIAnswerGenerator, change token computation

3a02312

Timoeller requested a review from a team as a code owner August 19, 2022 18:49

Timoeller requested review from bogdankostic and removed request for a team August 19, 2022 18:49

tholor reviewed Aug 22, 2022

View reviewed changes

bogdankostic suggested changes Aug 24, 2022

View reviewed changes

bogdankostic mentioned this pull request Oct 11, 2022

feat: Add OpenAIEmbeddingEncoder to EmbeddingRetriever #3356

Merged

6 tasks

Timoeller closed this Oct 21, 2022

masci deleted the openai_fixes branch September 13, 2023 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Add rate limit to OpenAIAnswerGenerator, change token computation #3078

Fix: Add rate limit to OpenAIAnswerGenerator, change token computation #3078

Timoeller commented Aug 19, 2022 •

edited

Loading

tholor Aug 22, 2022

tholor Aug 22, 2022

Timoeller Aug 25, 2022

bogdankostic left a comment

bogdankostic Aug 24, 2022

Timoeller Aug 25, 2022

Timoeller commented Aug 25, 2022

Timoeller commented Oct 21, 2022

Fix: Add rate limit to OpenAIAnswerGenerator, change token computation #3078

Fix: Add rate limit to OpenAIAnswerGenerator, change token computation #3078

Conversation

Timoeller commented Aug 19, 2022 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Checklist

tholor Aug 22, 2022

Choose a reason for hiding this comment

tholor Aug 22, 2022

Choose a reason for hiding this comment

Timoeller Aug 25, 2022

Choose a reason for hiding this comment

bogdankostic left a comment

Choose a reason for hiding this comment

bogdankostic Aug 24, 2022

Choose a reason for hiding this comment

Timoeller Aug 25, 2022

Choose a reason for hiding this comment

Timoeller commented Aug 25, 2022

Timoeller commented Oct 21, 2022

Timoeller commented Aug 19, 2022 •

edited

Loading