GitHub - dsshim0125/gaussian-ram at 620b9d770dc1e9c503937d2a0fc0e134abd62439

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
checkpoints		checkpoints
LICENSE		LICENSE
README.md		README.md
dataloader.py		dataloader.py
fig.png		fig.png
inference.py		inference.py
mnist_generation.py		mnist_generation.py
model.py		model.py
modules.py		modules.py
train.py		train.py
utils.py		utils.py

Repository files navigation

Gaussian RAM: Lightweight Image Classification via Stochastic Retina Inspired Glimpse and Reinforcement Learning

ICROS ICCAS 2020 Student Best Paper Finalist

Official PyTorch implementation of Gaussian-RAM

Abstract

Previous studies on image classification have been mainly focused on the performance of the networks, not on real-time operation or model compression. We propose a Gaussian Deep Recurrent visual Attention Model (GDRAM)- a reinforcement learning based lightweight deep neural network for large scale image classification that outperformsthe conventional CNN (Convolutional Neural Network) which uses the entire image as input. Highly inspired by the biological visual recognition process, our model mimics the stochastic location of the retina with Gaussian distribution. We evaluate the model on Large cluttered MNIST, Large CIFAR-10 and Large CIFAR-100 datasets which are resized to 128 in both width and height.

Dataset

Cluttered MNIST(download), CIFAR10 and CIFAR100 are used to train and evaluate. All the images are resized to 128 in both height and weight for generating high scale image.

Requirements

Python3
PyTorch (> 1.0)
torchvision (> 0.2)
PIL
NumPy

Training

python train.py --data_path --dataset --batch_size --lr --epochs --random_seed --log_interval --resume --checkpoint

Inference

python inference.py --data_path --dataset --random_seed --fast

Acknowledgement

This work was supported by Institute of Information & Communications Technology Planning & Evaluation(IITP) grant funded by the Korea government (MSIT) (No. 2019-0-01367, Infant-Mimic Neurocognitive Developmental Machine Learning from Interaction Experience with Real World (BabyMind))

References

[1] Dijkstra, E. W. (1968). Go to statement considered harmful. Communications of the ACM, 11(3), 147-148. [2] Dijkstra, E. W. (1968). Go to statement considered harmful. Communications of the ACM, 11(3), 147-148.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gaussian RAM: Lightweight Image Classification via Stochastic Retina Inspired Glimpse and Reinforcement Learning

ICROS ICCAS 2020 Student Best Paper Finalist

Abstract

Dataset

Requirements

Training

Inference

Acknowledgement

References

About

Releases

Packages

Languages

License

dsshim0125/gaussian-ram

Folders and files

Latest commit

History

Repository files navigation

Gaussian RAM: Lightweight Image Classification via Stochastic Retina Inspired Glimpse and Reinforcement Learning

ICROS ICCAS 2020 Student Best Paper Finalist

Abstract

Dataset

Requirements

Training

Inference

Acknowledgement

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages