Converging of the network for different data sets? #12

shamanez · 2017-12-26T05:29:46Z

I tried to train this network to identify 13 classes with 5 RGB images for each class.
One image is like this.

I modified the network to work with RGB. But even after 100000 iterations cannot see any kind of convergence.

Do you think this network is not capable of remembering information in above-mentioned images? Because in character data set the information is not complex as much as in above type images.

snowkylin · 2018-01-19T02:33:04Z

I am not very sure about this, but generally NTM is harder to converge compared with other DL models, which is a main drawback of this model.

albertchristianto · 2018-07-25T01:05:56Z

Hi @snowkylin, I also faced some problem with RGB images. Have you tried MANN implementation for RGB images.
I have tried your code and this MANN code (https://github.com/hmishra2250/NTM-One-Shot-TF) for RGB images. hmishra2250's code converged but it takes a long time.
On the other hand, your code even can't converged.
I am just curious what is the different between your code and hmishra2250's code?
I have tried to compare your code and hmishra2250's code. But, because I am still new on this topic, I haven't (maybe couldn't) found any useful information. Thanks in advanced.
Best regards,
Albert

snowkylin · 2018-07-25T06:36:42Z

@albertchristianto No, I haven't tried RGB images. Maybe you can try to convert images into gray scale and see if there is any change on the performance of the model. Sorry that I am busy these days and cannot afford much time to analyse other codes.

shamanez · 2018-07-25T13:24:28Z

@albertchristianto what about the accuracy? It is working or just converging?

albertchristianto · 2018-07-25T13:35:55Z

@snowkylin I extracted the feature using VGG16 then use it as my MANN input. it's okay. Thank you for your quick response.
@shamanez for the hmishra2250's code converged (also the accuracy) but it takes a long time, also hmishra2250's code didn't have very clear explanation about what happen in the training process, therefore i am kind of doubting the result. I have a hard time to modify the parameter from hmishra2250's code.

shamanez · 2018-07-30T00:28:42Z

@albertchristianto check whether he used something like weight clipping or gradient norm. Because for me this cord is error free algorithmic wise.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converging of the network for different data sets? #12

Converging of the network for different data sets? #12

shamanez commented Dec 26, 2017 •

edited

Loading

snowkylin commented Jan 19, 2018

albertchristianto commented Jul 25, 2018

snowkylin commented Jul 25, 2018

shamanez commented Jul 25, 2018

albertchristianto commented Jul 25, 2018 •

edited

Loading

shamanez commented Jul 30, 2018

Converging of the network for different data sets? #12

Converging of the network for different data sets? #12

Comments

shamanez commented Dec 26, 2017 • edited Loading

snowkylin commented Jan 19, 2018

albertchristianto commented Jul 25, 2018

snowkylin commented Jul 25, 2018

shamanez commented Jul 25, 2018

albertchristianto commented Jul 25, 2018 • edited Loading

shamanez commented Jul 30, 2018

shamanez commented Dec 26, 2017 •

edited

Loading

albertchristianto commented Jul 25, 2018 •

edited

Loading