AutoEncoder using the EfficientNet #257

xingyaoww · 2020-12-20T15:23:54Z

The AutoEncoder is implemented by reverse the forward EfficientNet as a decoder, current implementation only uses Dynamic Padding for TransposedConv2d which works fine for me now.

lukemelas · 2020-12-23T05:41:47Z

Thanks for this PR! Very interesting. I'll have to think about whether this should be integrated into the main repo or whether it should be a standalone repo. Either way, we'll make sure the community can benefit from this good work!

I might be a bit slow to respond over the next week or two due to the holidays, so do not fret if that is the case.

…image size issue; add latent feature by down/upsampling between encoder and decoder;

xingyaoww · 2020-12-28T15:59:13Z

Thank you for your reply!

I just updated my implementation for AE with TransposedConv2dStaticSamePadding, since the original version didn't take odd image size into consideration: For example, when image size is changed from (29,29) to (15,15) by Conv2d, its reverse TransposedConv2d operation should convert image size (15,15) into (29,29) instead of (30,30).

The old implementation using TransposedConv2dDynamicSamePadding will convert image size into (30,30) and causing output shape issue. DynamicSamePadding only seems to work for efficientnet models with even image size (works for efficientnet-b0, but not efficientnet-b5), therefore, I am also removing TransposedConv2dDynamicSamePadding in recent commits.

AFAgarap · 2021-11-21T13:12:52Z

Hello. Will this be merged?

leejonggun · 2021-12-30T11:30:25Z

Great Pull Request! I am trying EfficientNetAutoEncoder.from_pretrained(), and wondering below shape is correct or not.
That's why, I have just learned autoencoder is unsupervised learning type so that input shape and output shape is the same.
The autoencoder output for efficientnet-b0~7 is different as below. Could you tell me this is fine or bug?
0: input/(512,512) -> ae_output/(512,512)
1: input/(512,512) -> ae_output/(496,496)
2: input/(512,512) -> ae_output/(484,484)
3: input/(512,512) -> ae_output/(492,492)
4: input/(512,512) -> ae_output/(508,508)
5: input/(512,512) -> ae_output/(488,488)
6: input/(512,512) -> ae_output/(496,496)
7: input/(512,512) -> ae_output/(504,504)
(I'm looking into the code, but it's difficult ;) Thanks in advance if you help me)

cwerner · 2022-06-04T09:43:45Z

Also looking forward to this PR being merged 👍

xingyaoww added 8 commits December 20, 2020 12:11

modify model to EfficientNetAutoEncoder

5f24334

add comments

ff4dec4

make compatible with original EfficientNet

4d64d87

revert forward func of EfficientNet

b975d38

fix indentation of EfficientNet

e8648d8

fixed variable

b34fb73

fixed variable

52067bd

modify comments

f974bad

xingyaoww added 2 commits December 28, 2020 23:41

add TransposedConv2dStaticSamePadding to fix TransposedConv2d to odd …

16e0633

…image size issue; add latent feature by down/upsampling between encoder and decoder;

remove debug print for clarity

4095e2c

christiansafka mentioned this pull request Oct 31, 2021

How can I create image embeddings of size 256? christiansafka/img2vec#36

Closed

alhichri approved these changes May 23, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoEncoder using the EfficientNet #257

AutoEncoder using the EfficientNet #257

xingyaoww commented Dec 20, 2020

lukemelas commented Dec 23, 2020

xingyaoww commented Dec 28, 2020

AFAgarap commented Nov 21, 2021

leejonggun commented Dec 30, 2021 •

edited

Loading

cwerner commented Jun 4, 2022

AutoEncoder using the EfficientNet #257

Are you sure you want to change the base?

AutoEncoder using the EfficientNet #257

Conversation

xingyaoww commented Dec 20, 2020

lukemelas commented Dec 23, 2020

xingyaoww commented Dec 28, 2020

AFAgarap commented Nov 21, 2021

leejonggun commented Dec 30, 2021 • edited Loading

cwerner commented Jun 4, 2022

leejonggun commented Dec 30, 2021 •

edited

Loading