Qwen3 Reranker 0.6B GGUF

JSON →
alibaba reranking
text

A GGUF quantized 0.6-billion parameter Qwen3 reranker for lightweight local inference via llama.cpp.

context window 32K tokens
max output 32K tokens
streaming
releasedApr 2025