Name		Name	Last commit message	Last commit date
Latest commit History 204 Commits
ThirdParty		ThirdParty
core		core
models		models
.gitignore		.gitignore
Capsule.py		Capsule.py
ExVAE.py		ExVAE.py
ImageNet.py		ImageNet.py
ImageNet.sh		ImageNet.sh
LICENSE		LICENSE
README.md		README.md
SimpleMNIST.py		SimpleMNIST.py
pggan_cifar10.py		pggan_cifar10.py

Repository files navigation

TensorMONK

A collection of deep learning architectures (a PyTorch implementation).

Dependencies

python 3.6
PyTorch > 0.4.1
torchvision
visdom

How to train ImageNet?

If you have more nvidia graphic cards & cores available, adjust the batch size (BSZ), number of GPUs (gpus), & number of threads (cpus) accordingly in the ./ImageNet.sh. Next, select an available architecture and update your train & validation folder location (trainDataPath and testDataPath). Finally, run ./ImageNet.sh.

How to train CapsuleNet?

To replicate Hinton's paper on MNIST, run the following:

python Capsule.py -A capsule -B 256 -E 500 --optimizer adam --gpus 2 --cpus 6 --trainDataPath ./data --testDataPath ./data --replicate_paper

Ignore the replicate_paper argument to create a deep architecture (with few residual blocks before primary capsule). You can essentially add any block available in NeuralLayers to create a deeper architecture, which is followed by a primary capsule and secondary capsule. However, do consider two things 1. if you do reconstruction, update the reconstruction network relative to tensor_size, 2. capsule nets do require a good amount of gpu ram.

Generative Adversarial Networks GAN

Progressive Growing of GANs

Trained on CIFAR10 (pggan-cifar10.py) -- requires more training (more gpus)!

Details on core (NeuralArchitectures, NeuralEssentials, NeuralLayers)

NeuralArchitectures

ResidualNet -- use type = r18/r34/r50/r101/r152 or rn50/rn101/rn152 for ResNeXt or ser50/ser101/ser152 for Squeeze-and-Excitation Networks or sern50/sern101/sern152 (ResNeXt + Squeeze-and-Excitation Networks). Pretrained weights are available for r18, r34, r50, r101, & r152.
DenseNet -- use type (see Table 1 in paper) - d121/d169/d201/d264. Pretrained weights are available for d121, d169, & d201.
InceptionV4
MobileNetV1
MobileNetV2
ShuffleNet (type = g1, g2, g3, g4, g8 from table 1)
CapsuleNet (Hinton's version, and an example deep network)
LinearVAE
ConvolutionalVAE
SimpleNet
PGGAN
ContextNet (bottleneck layer of ContextNet is implemented in NeuralLayers/CarryResidue/)
PointNet
UNet
- UNetPatch -- Works on small patches.
NeuralDecisionForest

NeuralEssentials

BaseModel -- A base class that contains networks (embedding, loss or any), meters (loss, accuracy etc), fileName, isCUDA
CudaModel -- Converts any model (pytorch module) to run on single gpu or multiple gpu's or cpu
LoadModel -- Loads pretrained models (usually, from ./models)
SaveModel -- Save models (usually, state_dict of anything that starts with net in BaseModel, and rest as is)
MakeModel -- Builds model using base class
- MakeCNN -- Creates a CNN (netEmbedding) and loss layer (netLoss)
- MakeAE -- Creates an auto-encoder/vae in netAE
FolderITTR -- PyTorch image folder iterator with few extras.
MNIST -- MNIST train and test dataset loader
CIFAR10 -- CIFAR10 train and test dataset loader
MakeGIF -- Given a list of images creates a gif
VisPlots -- Visdom wrapper to visualize weight histograms, responses, and weights (see SimpleMNIST.py)

NeuralLayers

Convolution -- A convolution layer with following parameters:
- tensor_size = a list/tuple of length 4 (BxWxHxC - any B should work)
- filter_size = int/list/tuple (if list/tuple, length must be 2)
- out_channels = int
- strides = int/list/tuple (if list/tuple, length must be 2)
- pad = True/False (True essentially delivers same output size when strides = 1, and False returns valid convolution)
- activation = relu/relu6/lklu(leaky relu)/tanh/sigm/maxo/swish
- dropout = 0. to 1. (adds dropout layer)
- normalization = None/"batch"/"group"/"instance"/"layer"/"pixelwise"
- pre_nm = True/False (when True along with normalization = "batch" -- batch normalization + activation + convolution else convolution + batch normalization + activation)
- groups = 1 (default)
- weight_nm = True/False
- equalized = True/False
- shift = True/False -- requires 3x3 kernel, and a minimum #channels of input >= 9
ConvolutionTranspose -- A convolution transpose layer with parameters same as Convolution layer
CarryResidue -- Has several layers that requires residual connections or concatenation
- ResidualOriginal
- ResidualComplex
- ResidualNeXt
- SEResidualComplex
- SEResidualNeXt
- ResidualInverted
- ResidualShuffle
- SimpleFire
- DenseBlock
- Stem2
- InceptionA
- InceptionB
- InceptionC
- ReductionA
- ReductionB
- ContextNet_Bottleneck
PrimaryCapsule
RoutingCapsule
LossFunctions
- CapsuleLoss
- CategoricalLoss -- Cross entropy / softmax / taylor softmax / large margin cosine loss / large-margin Gaussian Mixture
- CenterLoss
- TripletLoss
- DiceLoss / Tversky Loss

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorMONK

Dependencies

How to train ImageNet?

How to train CapsuleNet?