ExpressNet is an autoregressive BiLSTM-based decoder-only model with Additive Attention Mechanism (Bahdanau Attention). Developed by Anar Lavrenov, Head of AI at SPUNCH. For now ExpressNet is made for binary/multi classification tasks and there are plans for adding other tasks as well. The main distinguishing feature of this model is high perfomance without text preprocessing.
Clone this repository
!git clone https://github.com/anarlavrenov/ExpressNet
%cd ExpressNet
Import model
from ExpressNet.model import ExpressNet
Initialize model with your own hyperparameters
model = ExpressNet(
d_model=256,
vocab_size=len(vocab),
classification_type="multiclass",
n_classes=4
).to(device)
ExpressNet showed decent results on validatation on most of torchtext datasets. General parameters were used:
- No text preprocessing at all: no stopwords removal, no lemmatization etc.
basic english
torch tokenizer everywhere- d_model: 256 everywhere
- Primarily usage: playground for Machine Learning Researches and Data Scientists. You are very welcome to share your insights and recommendations.
- Secondary usage: baseline for most of classification tasks without any text preprocessing. If you want to achieve instant high validation accuracy - you are welcome to use ExpressNet.