This project is a simple implementation of the Transformer model from scratch using Python. It is intended for educational purposes to deepen understanding of the Transformer architecture. Feel free to experiment and modify the code for your learning purposes.
-
Clone the repository:
git clone https://github.com/rushizirpe/transformer-from-zero.git cd transformer-from-zero
-
Run the main script:
python train.py
- This project is not optimized for production use.