Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

bdashore3 / flash-attention Public

forked from Dao-AILab/flash-attention

Notifications You must be signed in to change notification settings
Fork 17
Star 208

Code
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Security
Insights

Releases: bdashore3/flash-attention

Releases · bdashore3/flash-attention

v2.6.3

26 Jul 00:14

Compare

Choose a tag to compare

Loading

Synced to Upstream version

NOTE: Backward and Dropout are disabled meaning that this release is INFERENCE ONLY.

This is because including these features more than doubles the build time and makes the github action time itself out. Please raise an issue to the parent repo to help reduce the build times if you want these features.

Assets 17

flash_attn-2.6.3+cu123torch2.2.2cxx11abiFALSE-cp310-cp310-win_amd64.whl

59.7 MB 2024-07-26T00:19:00Z
flash_attn-2.6.3+cu123torch2.2.2cxx11abiFALSE-cp311-cp311-win_amd64.whl

59.7 MB 2024-07-26T00:19:20Z
flash_attn-2.6.3+cu123torch2.2.2cxx11abiFALSE-cp312-cp312-win_amd64.whl

59.7 MB 2024-07-26T00:15:40Z
flash_attn-2.6.3+cu123torch2.2.2cxx11abiFALSE-cp38-cp38-win_amd64.whl

59.7 MB 2024-07-26T00:18:53Z
flash_attn-2.6.3+cu123torch2.2.2cxx11abiFALSE-cp39-cp39-win_amd64.whl

59.7 MB 2024-07-26T00:19:55Z
flash_attn-2.6.3+cu123torch2.3.1cxx11abiFALSE-cp310-cp310-win_amd64.whl

59.7 MB 2024-07-26T00:17:41Z
flash_attn-2.6.3+cu123torch2.3.1cxx11abiFALSE-cp311-cp311-win_amd64.whl

59.7 MB 2024-07-26T00:14:36Z
flash_attn-2.6.3+cu123torch2.3.1cxx11abiFALSE-cp312-cp312-win_amd64.whl

59.7 MB 2024-07-26T00:22:00Z
flash_attn-2.6.3+cu123torch2.3.1cxx11abiFALSE-cp38-cp38-win_amd64.whl

59.7 MB 2024-07-26T00:16:53Z
flash_attn-2.6.3+cu123torch2.3.1cxx11abiFALSE-cp39-cp39-win_amd64.whl

59.7 MB 2024-07-26T00:21:33Z
Source code (zip)

2024-07-25T22:02:36Z
Source code (tar.gz)

2024-07-25T22:02:36Z
Loading

typicaldigital, ZongWei-HUST, wh403948123, GaussianGuaicai, git-uniqity, Piscabo, and newstargo reacted with heart emoji

All reactions

❤️ 7 reactions

7 people reacted

v2.6.1

12 Jul 00:06

Compare

Choose a tag to compare

Loading

Actions: Switch to CUDA 12.3

Signed-off-by: kingbri <[email protected]>

Assets 11

Loading

jyoung105, search620, and CNBigOrange reacted with thumbs up emoji

All reactions

👍 3 reactions

3 people reacted

v2.5.9.post2

09 Jul 23:28

Compare

Choose a tag to compare

Loading

v2.5.9.post2 Pre-release

Pre-release

Quick release to add softcapping commits. Does not have backward, dropout, or alibi support.

Assets 12

Loading

All reactions

v2.5.9.post1

28 May 01:45

Compare

Choose a tag to compare

Loading

Actions: Clarify dispatch formatting

Signed-off-by: kingbri <[email protected]>

Assets 16

Loading

BBC-Esq, ChangxingJiang, Extraphy, aidenljk, melMass, huwhitememes, alicat22, PommesPeter, abepuentes, Piscabo, and 3 more reacted with thumbs up emoji

All reactions

👍 13 reactions

13 people reacted

v2.5.8

28 Apr 07:55

Compare

Choose a tag to compare

Loading

Same as Upstream tag

Now built for only torch 2.2.2 and 2.3.0

Assets 10

Loading

yangbaoquan, onlyjokers, Eng-AliKazemi, and sirliuyang reacted with thumbs up emoji

All reactions

👍 4 reactions

4 people reacted

v2.5.6

30 Mar 20:34

Compare

Choose a tag to compare

Loading

Same as Upstream tag

Assets 10

Loading

lijiajun3029, LucisVivae, yangbaoquan, wling-art, and heloook reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

v2.5.2

07 Feb 22:37

Compare

Choose a tag to compare

Loading

Same as the upstream tag

Adds this PR to help fix building on Windows

Assets 10

Loading

biship, BBC-Esq, HarrySoteriou, AnyaCoder, yilun-lee, and VedaEistelu reacted with thumbs up emoji

All reactions

👍 6 reactions

6 people reacted

v2.4.2

03 Feb 00:29

Compare

Choose a tag to compare

Loading

Inline with the parent repo's tag

Made for cuda 12.x and pytorch 2.1.2 and 2.2

v2.4.3 and up cannot be built on Windows at this time.

Assets 10

Loading

alicat22, zzj136598, biship, sheepdestroyer, and PumpkinZili reacted with heart emoji

All reactions

❤️ 5 reactions

5 people reacted

v2.4.1

25 Dec 06:26

Compare

Choose a tag to compare

Loading

Add Windows workflows

Assets 6

Loading

Kim-Taeksu, Republic1024, zazazazaziioo, x-xxxsl, and kaseketsu reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

2.3.3-windows

18 Nov 23:37

Compare

Choose a tag to compare

Loading

In parity with the original tag

Built with Pytorch 2.1.1 and CUDA 12.2. This wheel will work with pytorch 2.1+ and CUDA 12+

Full Changelog: https://github.com/bdashore3/flash-attention/commits/2.3.3

Assets 4

Loading

BriannaBromell reacted with hooray emoji

All reactions

🎉 1 reaction

1 person reacted

Previous 1 2 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.