Qwen3 Reranker 4B Seq Cls vLLM W4A16

JSON →
alibaba reranking
text

A 4-bit quantized Qwen3 reranker optimized for sequence classification with vLLM support.

context window 41K tokens
max output 41K tokens
streaming
releasedApr 2025