Core operations for flash-linear-attention

library 0.4.2 ·python

✓ verified Jun 30, 2026

fla-core is a Python library providing efficient, Triton-based implementations of core operations and kernels for state-of-the-art linear attention and state-space models. It serves as a minimal-dependency subset of the larger 'flash-linear-attention' project, focusing on the fundamental computational building blocks. It is currently at version 0.4.2 and follows a regular release cadence, often in conjunction with its parent project, flash-linear-attention.

Traffic · last 30 days ↑17% vs prev 7d · indexed Wed Apr 15 · updated Sat Jul 11

total hits 32

actors 7 distinct systems

last hit 7d ago ChatGPT-User

ChatGPT-User

ByteDance

OAI-SearchBot

Script

Search engines

Humans

top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇬🇧 United Kingdom · 🇨🇦 Canada · 🇫🇮 Finland

Resources

githubgithub.com/fla-org/flash-linear-attention ↗

packagepypi.org/project/fla-core/ ↗

API endpoints

full doc /v1/registry/fla-core

install /v1/registry/fla-core/install

compatibility /v1/registry/fla-core/compatibility