Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"
natural-language-processing
deep-learning
neural-network
transformers
jacobi-iteration
decoding-algorithm
parallel-decoding
jacobi-decoding
-
Updated
Mar 15, 2024 - Python