Image GPT Support #282

appvoid · 2023-06-24T14:06:20Z

https://github.com/openai/image-gpt

i don't know how hard would be to implement this model. It seems to have the same architecture as gpt-2.

What is fantastic though it's the possibilities around this model. Just imagine having a stable diffusion like model in your raspberry pi. And even though the 32x32 size is pretty limited, there is already things like realesrgan which amplifies the resolution upto 4 times!

128x128 high-quality images without any dependencies, please considered at least!

appvoid · 2023-06-24T16:33:25Z

I'm pretty bad at these things (I'm just a python guy) but here's a high level overview if anyone wants to try:

This involves reshaping the images into a 1D sequence and applying the transformer decoder to predict the pixels.
They define a "mask" that randomly hides certain elements (pixels) in an image using BERT. I think we have it here: bert.cpp
Parameter counts are s:76M, m:455M, l:1362M (Don't know yet if they published xl)

To give an idea on how good this could be:

Test image (64x64)

Native 4x upscaling (256x256)

I used an ncnn implementation of realesrgan to upscale it. Meaning that this could be a good oportunity of doing a pretty cool spinoff with the ncnn community hence proving that open source projects yet similar, can cohexist and work together somehow.

appvoid · 2023-06-27T01:00:30Z

Closing this as it seems like there is not any interest in it in general. It appears to be just an image completion tool and since projects like this already exist, it doesn't worth it trying at least for now.

@nullhook

* working but ugly * add arg flag, not working on embedding mode * typo * Working! Thanks to @nullhook * make params argument instead of hardcoded boolean. remove useless time check * start doing the instructions but not finished. This probably doesnt compile * Embeddings extraction support --------- Co-authored-by: Georgi Gerganov <[email protected]>

appvoid closed this as completed Jun 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image GPT Support #282

Image GPT Support #282

appvoid commented Jun 24, 2023

appvoid commented Jun 24, 2023 •

edited

Loading

appvoid commented Jun 27, 2023

Image GPT Support #282

Image GPT Support #282

Comments

appvoid commented Jun 24, 2023

appvoid commented Jun 24, 2023 • edited Loading

appvoid commented Jun 27, 2023

appvoid commented Jun 24, 2023 •

edited

Loading