Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to get more variation in the null image #27

Open
kchodorow opened this issue Sep 6, 2022 · 0 comments
Open

How to get more variation in the null image #27

kchodorow opened this issue Sep 6, 2022 · 0 comments

Comments

@kchodorow
Copy link

I've been generating images using this model, which is delightfully fast, but I've noticed that it produces images that are all alike. I tried generating the "null" image by doing:

H = perceptor.encode_text(toks.to(device)).float()
z = net(0 * H)

This resulted in:

base image

And indeed, everything I generated kind of matched that: you can see the fleshly protrusion on the left in "gold coin":

gold-coin--0 0

The object and matching mini-object in "tent":

tent-0 5

And it always seems to try to caption the image with nonsense lettering ("lion"):

lion--0 0

So I'm wondering if there's a way to "prime" the model and suggest it use a different zero image for each run. Is there a variable I can set, or is this deeply ingrained in training data?

Any advice would be appreciated, thank you!

(Apologies if this is the same as #8, but it sounded like #8 was solved by using priors which doesn't seem to help with this.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant