Cut Cross Entropy
JSON →Cut Cross Entropy provides a highly memory-efficient implementation of the linear-cross-entropy loss function, primarily optimized for large language models and high-throughput inference scenarios. It is part of the vLLM project. The current version is 25.1.1, indicating a rapid development cycle, likely following a date-based or frequent release cadence, designed for NVIDIA GPUs.
Traffic · last 30 days ↓29% vs prev 7d
total hits 34
actors 8 distinct systems
last hit 23h ago human
top countries 🇸🇬 Singapore · 🇩🇪 Germany · 🇫🇷 France · 🇺🇸 United States · 🇨🇦 Canada
API endpoints
full doc /v1/registry/cut-cross-entropy
compatibility /v1/registry/cut-cross-entropy/compatibility