Skip to content
forked from ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

License

Notifications You must be signed in to change notification settings

rosenrodt/Tensile

 
 

Repository files navigation

A tool for creating a benchmark-driven backend library for GEMMs, GEMM-like problems (such as batched GEMM), N-dimensional tensor contractions, and anything else that multiplies two multi-dimensional objects together on a GPU.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 58.1%
  • C++ 38.5%
  • CMake 2.5%
  • Shell 0.8%
  • Makefile 0.1%
  • Emacs Lisp 0.0%