dsalt
JSON →dsalt (Dynamic Sparse Attention with Landmark Tokens) is a high-performance Triton-based implementation of sparse attention for transformers. Version 0.4.34 supports PyTorch and provides fused kernels for landmark token selection and sparse attention computation, targeting long-context LLM inference and training. Released monthly.