Skip to content

A collection of tricks to speed up transformer inference

Notifications You must be signed in to change notification settings

s3nh/transformer-tricks

 
 

Repository files navigation

About

A collection of tricks to speed up transformer inference

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TeX 90.5%
  • Jupyter Notebook 8.6%
  • Shell 0.9%