block-sparse flash attention support #851

jordiclive · 2023-03-22T20:24:50Z

I saw flash attention was recently merged.

This approximate attention would be cool to have as well for training very large sequence lengths. https://github.com/HazyResearch/flash-attention/blob/main/flash_attn/flash_blocksparse_attention.py

natek-1 · 2023-05-08T17:37:16Z

Hello,
I am new to the EleutherAI team and i think this would be a good issue to try to solve. May i be assigned this task please.

StellaAthena · 2023-05-08T18:01:17Z

@natek-1 Welcome! Thank you for your contribution.

dashstander · 2023-09-25T14:06:01Z

Hey @natek-1 , do you have any updates on this? It's totally alright if you haven't gotten a chance to look at it. Would it be alright if we assigned it to someone else?

jordiclive added the feature request New feature or request label Mar 22, 2023

StellaAthena added the good first issue Good for newcomers label Apr 30, 2023

StellaAthena assigned natek-1 May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

block-sparse flash attention support #851

block-sparse flash attention support #851

jordiclive commented Mar 22, 2023

natek-1 commented May 8, 2023

StellaAthena commented May 8, 2023

dashstander commented Sep 25, 2023

block-sparse flash attention support #851

block-sparse flash attention support #851

Comments

jordiclive commented Mar 22, 2023

natek-1 commented May 8, 2023

StellaAthena commented May 8, 2023

dashstander commented Sep 25, 2023