Core operations for flash-linear-attention
JSON →fla-core is a Python library providing efficient, Triton-based implementations of core operations and kernels for state-of-the-art linear attention and state-space models. It serves as a minimal-dependency subset of the larger 'flash-linear-attention' project, focusing on the fundamental computational building blocks. It is currently at version 0.4.2 and follows a regular release cadence, often in conjunction with its parent project, flash-linear-attention.
Traffic · last 30 days ↑147% vs prev 7d
total hits 63
actors 14 distinct systems
last hit 1d ago human
top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇩🇪 Germany · 🇨🇦 Canada · 🇫🇷 France
API endpoints
full doc /v1/registry/fla-core
install /v1/registry/fla-core/install
compatibility /v1/registry/fla-core/compatibility