Qwen3 Reranker 4B Seq Cls vLLM W4A16
JSON →A 4-bit quantized Qwen3 reranker optimized for sequence classification with vLLM support.
Specs
context window 41K tokens
max output 41K tokens
Capabilities
streaming
Dates
releasedApr 2025
A 4-bit quantized Qwen3 reranker optimized for sequence classification with vLLM support.