How to get more variation in the null image #27

kchodorow · 2022-09-06T22:05:37Z

I've been generating images using this model, which is delightfully fast, but I've noticed that it produces images that are all alike. I tried generating the "null" image by doing:

H = perceptor.encode_text(toks.to(device)).float()
z = net(0 * H)

This resulted in:

And indeed, everything I generated kind of matched that: you can see the fleshly protrusion on the left in "gold coin":

The object and matching mini-object in "tent":

And it always seems to try to caption the image with nonsense lettering ("lion"):

So I'm wondering if there's a way to "prime" the model and suggest it use a different zero image for each run. Is there a variable I can set, or is this deeply ingrained in training data?

Any advice would be appreciated, thank you!

(Apologies if this is the same as #8, but it sounded like #8 was solved by using priors which doesn't seem to help with this.)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get more variation in the null image #27

How to get more variation in the null image #27

kchodorow commented Sep 6, 2022

How to get more variation in the null image #27

How to get more variation in the null image #27

Comments

kchodorow commented Sep 6, 2022