dsalt

JSON →
library 0.4.34 ·python
verified Jun 7, 2026

dsalt (Dynamic Sparse Attention with Landmark Tokens) is a high-performance Triton-based implementation of sparse attention for transformers. Version 0.4.34 supports PyTorch and provides fused kernels for landmark token selection and sparse attention computation, targeting long-context LLM inference and training. Released monthly.