Neural Machine Translation

TensorFlow 2.0 implementation of the popular NLP paper by Bahdanau et al. - Neural Machine Translation by Jointly Learning to Align and Translate (ICLR, 2015)

For detailed implementation with replicated results, use NMT_6400000

Following are the specifications followed as per the authors:

AdaDelta Optimizer with epsilon = 10-6 , rho = 0.95
Minibatch SGD with batch_size = 80
Embedding Dimension = 620
Hidden Layer Size = 1000
Output Layer Size = 500
Weights initialization = RandomNormal with Mean = 0 and Standard Deviation = 0.001
Bias initialization = Zeros
L2 Regularization

Total number of parameters = 28,332,000 (Encoder) + 3,003,001 (Attention) + 52,496,000 (Decoder) = 90,831,001

Use Presentation.pdf for a better understanding.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.DS_Store		.DS_Store
NMT_100000.ipynb		NMT_100000.ipynb
NMT_1600000.ipynb		NMT_1600000.ipynb
NMT_200000.ipynb		NMT_200000.ipynb
NMT_3200000.ipynb		NMT_3200000.ipynb
NMT_400000.ipynb		NMT_400000.ipynb
NMT_50000.ipynb		NMT_50000.ipynb
NMT_6400000.ipynb		NMT_6400000.ipynb
NMT_800000.ipynb		NMT_800000.ipynb
README.md		README.md
experimental_graph.png		experimental_graph.png
train_accuracy.png		train_accuracy.png