-
Notifications
You must be signed in to change notification settings - Fork 21.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add auto-tuning for sparse semi-structured MM operator #123742
base: gh/alexsamardzic/27/base
Are you sure you want to change the base?
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/123742
Note: Links to docs will display an error until the docs builds have been completed. ✅ You can merge normally! (1 Unrelated Failure)As of commit c32f481 with merge base 747b38c (): BROKEN TRUNK - The following job failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]
ghstack-source-id: 2b6a9f919c421d217f9c8db387bb2ebb8a968ac1 Pull Request resolved: #123742
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]
Note that the generated kernels will still sometimes fail because of alignment issues, am investigating this. |
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]
ghstack-source-id: f446525b13764d8e3015319edd874df4ca65ecb5 Pull Request resolved: #123742
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler amjames desertfire chauhang [ghstack-poisoned]
ghstack-source-id: a168fbb345de8133fbd0e6eac377f0bbaa12865c Pull Request resolved: #123742
This PR is ready for a review now. |
ghstack-source-id: f12b5627ced184a554a22b43acc06080946b078e Pull Request resolved: #123742
ghstack-source-id: 99f16df9425237ef19ffcb634776092d15258a72 Pull Request resolved: #123742
ghstack-source-id: 638222db69943b0be953f74f3965c179bc946bf6 Pull Request resolved: #123742
ghstack-source-id: 92c190ae61417cdb46333fb09e83accf2e124d91 Pull Request resolved: #123742
ghstack-source-id: 33648e99ee3837ebd72d5163d0965825cdbf297a Pull Request resolved: #123742
ghstack-source-id: 24f8733a2fcbf5230e89ccf1bd87efd49bdf670d Pull Request resolved: #123742
Added test validating that, on SM80 arch, at least one working CUTLASS candidate kernel is produced. |
ghstack-source-id: 38d1b8625d1c0caf83351e279c2d03c864111f9c Pull Request resolved: #123742
ghstack-source-id: 4cd28e35f49e0035c8409c459f762835a43e9292 Pull Request resolved: #123742
ghstack-source-id: 8860aa6a5190f7581b079f0a44cf028c478f7c13 Pull Request resolved: #123742
ghstack-source-id: 4f374aafef837ceaa6dca83c414567b64108c42c Pull Request resolved: #123742
ghstack-source-id: d85d11ffda50af05933eb49c28ed652f0ecd6f4a Pull Request resolved: #123742
Stack from ghstack (oldest at bottom):
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler @amjames @desertfire @chauhang