MoisesNNFS

This is my attempt to write a Deep Learning framework from scratch using only pure Numpy.

Warning: In progress.

Inspiration:

The awesome ML-From-scratch by Erik Linder-Norén
The super elegant and simple implementation python-neural-networks by Omar Aflak

Challenges:

😶 The biggest challenge is not just how to implement operations, but how to organize the code, such that the implementation won't come back to bite you later down the line.

Design philosophy:

🥇 Modularity: everything should be modular and independent, this goes for layers, activation functions, losses, optimizers...
Simplicity: the code must remain simple, easy to read as much as possible,
Efficiency: is not the start of the show here, but I've trying to optimize the code as much as possible without violation the second design philosophy (Simplicity)

Main Components:

Network: A container for the layers, takes in: optimizes, loss
Layers: The building block for NN:
- Usual NN layers: Dense, Conv2d, MaxPool2d ...
- Activation layers: Relu, LeakyRelu, ... !Note: the reason behind formulating the activation as layers, to make the gradient calculation much simpler and cleaner.
Losses: for measuring the error.
Optimzers: Responsible for updating layers parameters.

Layers:

Layers are the main building block of any NN framework, this one is no exception, everything is treated as layer, whether it's concrete NN layer (Dense/Conv2D/Pool...), utility layer (Dropout/Reshape) or Activation Function (The non-linearity part of the NN layer).

NN concrete layers should be simple, they should only perform basic linear operations, the activation layer is the part responsible for the non-linearity during the forward pass. Each NN concrete layer, must define two fundamental methods:
- forward : Take an input perform linear part returns output
- backward: Take the Accumulated Gradient (AG) from next layer, calculate the gradient w.r.t its weight and biases and passes them to the optimizer. return accumulated gradient (aka error).

Activation function: are treated as layer. They are responsible for:

+ The non-linearity during the backward pass
+ Calculate the accumulated gradient during the backward pass.

Weights:

Doesn't matter how they are represented, you're going to perform the transpose during the backward pass anyway. I Like Micheal's implementation, basically rows represent nodes in the current layer.

Optimizers:

The optimization part is performed by the optimizer (updating the net parameters )

Naming convention:

all names are underscore separated. losses: Activation: mse/mse_prime, cross_entropy/cross_entropy_prime/ p: y_pred : predicted/probability y: y_truth: target

input_size
output_size
layers_name

Guidlines:

_init_ methods should never raise NotImplmentedErrro
I don't wanna bother too much with type hinting, only use it where it's easy.
The code must remain simple, I don't want to deal with exception and error handling.., and also makes the translate of code to anothr library easy.

TIPs:

Use method instead of function call whenever possible for faster computation.

Helper function:

As it's always the case you're going to need helper functions, since there will be many utils helper function, they should be organized into data manipulation utils, and misc utils.

from MoisesNNFS.layers import Dense, Reshape
from MoisesNNFS.activatios import Tanh
from MoisesNNFS.losses import MSE
from MoisesNNFS.opimizers import SGD
from MoisesNNFS.Network import Network

MLP = Network(optimizer=SGD(learning_rate=0.001, epochs=20, batch_size=32), loss=MSE())
MLP.add(Reshape(np.product(data.shape),1), (data.shape))
MLP.add(Dense(100))
MLP.add(Tanh())
MLP.add(Dense(100))
MLP.add(Tanh())

MLP.fit(training_data, learning_rate=0.001, epochs=20, batch_size=32)

optimizer = SGD() 
layers = [Reshape((1, 784), input_shape=(28, 28)),
    Dense(30),
    LeakyReLU(0.2),
    Dense(16),
    LeakyReLU(0.2),
    Dense(30),
    LeakyReLU(0.2),
    Dense(784),
    Reshape((28, 28)) ])
MLP = Network(layers=layer, optimizer=optimizer, loss=MSE())

MLP.fit(training_data, learning_rate=0.001, epochs=20, batch_size=32)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Ideas		Ideas
__pycache__		__pycache__
utils		utils
CleanLayer.py		CleanLayer.py
ClearnLayers.py		ClearnLayers.py
ConvAsMatrixMult.py		ConvAsMatrixMult.py
CrossCorrAsMatrixMult.py		CrossCorrAsMatrixMult.py
LICENSE		LICENSE
MaxPooling.py		MaxPooling.py
Network.py		Network.py
NewLayers.py		NewLayers.py
Pool.py		Pool.py
README.md		README.md
Try.py		Try.py
__init__.py		__init__.py
activation.py		activation.py
initializer.py		initializer.py
layers.py		layers.py
losses.py		losses.py
optimizers.py		optimizers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoisesNNFS

This is my attempt to write a Deep Learning framework from scratch using only pure Numpy.

Warning: In progress.

Inspiration:

Challenges:

Design philosophy:

Main Components:

Layers:

Activation function: are treated as layer. They are responsible for:

Weights:

Optimizers:

Naming convention:

Guidlines:

TIPs:

Helper function:

About

Releases

Packages

Languages

License

moisestohias/MoisesNNFS

Folders and files

Latest commit

History

Repository files navigation

MoisesNNFS

This is my attempt to write a Deep Learning framework from scratch using only pure Numpy.

Warning: In progress.

Inspiration:

Challenges:

Design philosophy:

Main Components:

Layers:

Activation function: are treated as layer. They are responsible for:

Weights:

Optimizers:

Naming convention:

Guidlines:

TIPs:

Helper function:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages