Fix token usage with jump forward #174

comaniac · 2024-02-09T18:59:50Z

close #173

This PR fixes the incorrect token usage when jump forward is enabled. Specifically, we introduce a new field orig_prompt_tokens, which will be set when the first jump forward happens so that we could know the original number of prompt tokens. When returning a response (a chunk in streaming or a complete response), we use the following equations to correct the token usage:

completion_tokens = curr_prompt_token - orig_prompt_tokens + completion_tokens
prompt_tokens = orig_prompt_tokens

hnyls2002 · 2024-02-10T01:33:34Z

@comaniac Hi, I suggest initializing the orig_prompt_tokens when constructing the Req so that we can simplify the code.

comaniac · 2024-02-10T03:04:18Z

Thanks that makes sense. Meanwhile, can you add some comments back to explain why we need to calculate the token usage in this way?

Fix token usage with jump forward

aed223f

comaniac requested review from merrymercy and hnyls2002 and removed request for merrymercy February 9, 2024 18:59

init orig_prompt_tokens when init Req

c813cbe

Add comments

720ce3f

comaniac merged commit 4d303c4 into main Feb 10, 2024

comaniac deleted the cody/usage branch February 10, 2024 04:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix token usage with jump forward #174

Fix token usage with jump forward #174

comaniac commented Feb 9, 2024

hnyls2002 commented Feb 10, 2024

comaniac commented Feb 10, 2024

Fix token usage with jump forward #174

Fix token usage with jump forward #174

Conversation

comaniac commented Feb 9, 2024

hnyls2002 commented Feb 10, 2024

comaniac commented Feb 10, 2024