NVIDIA CUTLASS Python DSL
JSON →NVIDIA CUTLASS Python DSL (version 4.4.2) is a Python-based domain-specific language (DSL) for writing high-performance CUDA kernels. It provides a Pythonic interface to CUTLASS's CuTe library, enabling kernel development with automatic JIT compilation to optimized PTX/SASS for NVIDIA GPUs (Ampere, Hopper, Blackwell architectures). It aims for zero-cost abstraction, performance comparable to C++ kernels, and seamless integration with deep learning frameworks like PyTorch and JAX. The library maintains an active development pace with frequent updates and minor version releases.
Traffic · last 30 days ↑43% vs prev 7d
total hits 26
actors 9 distinct systems
last hit 11h ago ByteDance
top countries 🇺🇸 United States · 🇸🇬 Singapore · 🇨🇦 Canada · 🇫🇷 France · 🇩🇪 Germany
Resources
API endpoints
full doc /v1/registry/nvidia-cutlass-dsl
compatibility /v1/registry/nvidia-cutlass-dsl/compatibility