SageAttention

JSON →
library 2.0.1 ·python
verified May 27, 2026

SageAttention is a Python library providing accurate and efficient 8-bit plug-and-play attention mechanisms, including Mixture-of-Experts (MoE) implementations. It aims to accelerate large language models with minimal performance drop. The current bleeding-edge version is 2.0.1, though the PyPI package might lag behind GitHub releases. Releases typically occur when major architectural changes or significant features are implemented.

total hits 16
actors 7 distinct systems
last hit 3d ago MetaBot
GPTBot
5
MetaBot
4
Script
1
ClaudeBot
1
ChatGPT-User
1

top countries 🇺🇸 United States · 🇨🇦 Canada · 🇫🇷 France · 🇩🇪 Germany