Skip to content

Commit

Permalink
Revert change in tokenization.py
Browse files Browse the repository at this point in the history
  • Loading branch information
jon-barker committed Jun 28, 2024
1 parent 7fb826e commit aa1c33b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion megatron/inference/text_generation/tokenization.py
Original file line number Diff line number Diff line change
Expand Up @@ -111,7 +111,7 @@ def _tokenize_prompts_and_batch(prompts, tokens_to_generate, add_BOS):
prompts_tokens = [[eod_token] + tokenizer.tokenize(prompt)
for prompt in prompts]
else:
prompts_tokens = [tokenizer.instruct_tokenize(prompt) for prompt in prompts]
prompts_tokens = [tokenizer.tokenize(prompt) for prompt in prompts]

# Now we have a list of list of tokens which each list has a different
# size. We want to extend this list to:
Expand Down

0 comments on commit aa1c33b

Please sign in to comment.