Qwen3 Reranker 4B Seq Cls vLLM W4A16 ASYM
JSON →An asymmetric 4-bit quantized Qwen3 reranker for sequence classification with vLLM support.
Specs
context window 41K tokens
max output 41K tokens
Capabilities
streaming
Dates
releasedApr 2025