SHA-RNN

Implementation of Single Headed Attention - Recurrent Neural Networks in Julia and Knet.

Stephan Merity. Single Headed Attention RNN: Stop Thinking With Your Head. arXiv preprint arXiv:1911.11423, 2019.

After downloading the data and preprocessing it using

sh getdata.sh

You can train the main model of SHA-RNN paper by either:

running sharnn-main.jl in shell

cd examples
julia sharnn-main.jl

or using SHA-RNN notebook.

This implementation is identical to the one of Smerity's original implementation sha-rnn.

But it is slower, since it does not use the same performance tricks that the version of SHA-RNN that was implemented using pytorch uses.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data/enwik8		data/enwik8
examples		examples
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SHA-RNN(2).png		SHA-RNN(2).png
getdata.sh		getdata.sh

Provide feedback