Core operations for flash-linear-attention

JSON →
library 0.4.2 ·python
verified May 24, 2026

fla-core is a Python library providing efficient, Triton-based implementations of core operations and kernels for state-of-the-art linear attention and state-space models. It serves as a minimal-dependency subset of the larger 'flash-linear-attention' project, focusing on the fundamental computational building blocks. It is currently at version 0.4.2 and follows a regular release cadence, often in conjunction with its parent project, flash-linear-attention.

total hits 63
actors 14 distinct systems
last hit 1d ago human
ByteDance
10
OAI-SearchBot
6
Amazonbot
4
MetaBot
4
GPTBot
2
Script
2
ChatGPT-User
2
Sogou
1
Google-Ads
1
Search engines
11
Humans
8

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France