replace skip_embed with input_embeds #222

TideDra · 2024-02-23T16:19:00Z

This PR replace the arg skip_embed: bool with input_embeds: torch.Tensor in the forward functions of all the language models.

This fixes the bug raised by LogitsProcessor when using llava in Extend mode and return log_prob, such as using gen(choices=[...]) with llava, where LogitsProcessor need to receive input_ids and uses it to get the token logprob. However, the input_ids given to LogitsProcessor by llava is actually the input embedding, which will cause a Tensor dimension mismatch error.

TideDra · 2024-02-23T16:27:40Z

With this fix, we can use gen(choices=[...]) with llava in text-only mode but can not give images, such as:

@function
def test(s,img_path,question):
    s += user(image(img_path)+question+" Please answer yes or no.")
    s += assistant(gen('answer',choices=['yes','no']))

This will raise Runtime error, and it seems to be caused by the cache system not working well with image input.

aliencaocao · 2024-03-10T10:50:11Z

Same observation with runtime error for image inputs

replace skip_embed with input_embed

6b80644

merrymercy merged commit 64fe311 into sgl-project:main Mar 11, 2024

TideDra mentioned this pull request Mar 11, 2024

RuntimeError in llava image encoding #273

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace skip_embed with input_embeds #222

replace skip_embed with input_embeds #222

TideDra commented Feb 23, 2024

TideDra commented Feb 23, 2024

aliencaocao commented Mar 10, 2024

replace skip_embed with input_embeds #222

replace skip_embed with input_embeds #222

Conversation

TideDra commented Feb 23, 2024

TideDra commented Feb 23, 2024

aliencaocao commented Mar 10, 2024