GitHub - MootezSaaD/csci6908-a1-pt2

The following repository contains the replication package to train an LSTM model for next word generation conditioned on literary genres.

Files and Folder Structure

data: contains training, test and validation data in form of pickle files. It also contains vocab dictionary and the genres mapping. They can be downloaded from the following Google Drive folder: https://drive.google.com/drive/folders/1SaBaq1KmAvj9K-4a9vvsdxzp8iOniRYL?usp=sharing
logs: Contains Tensorboard logs from my experiment.
model2: This package contains the model implementation, data classes and utility functions for I/O operations and training.
checkpoint: Contains the model's weights after training.
Data_Preparation.ipynb: Jupyter notebook used to generate the data provided in the Google Drive folder.
train_attn.py Main script that contains the logging, hyperparemeters values, and training/testing loops.
M2Inference.ipynb: Jupyter notebook used for inference (text generation).

Training

(Option) First step is to generate the data files. This an optional step as training files are already provided.
To train the model, execute the the following:

python train_attn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Files and Folder Structure

Training

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
checkpoint		checkpoint
data		data
logs		logs
model2		model2
Data_Preparation.ipynb		Data_Preparation.ipynb
LICENSE		LICENSE
M2Inference.ipynb		M2Inference.ipynb
README.md		README.md
train_attn.py		train_attn.py

License

MootezSaaD/csci6908-a1-pt2

Folders and files

Latest commit

History

Repository files navigation

Files and Folder Structure

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages