Kernel Case Study: Flash Attention | Towards Data Science

Understanding all versions of flash attention through a triton implementation

By · · 1 min read
Kernel Case Study: Flash Attention | Towards Data Science

Source: Towards Data Science

Understanding all versions of flash attention through a triton implementation