Qwen3 Reranker 0.6B MLX 8Bit

JSON →
mku64 reranking
text

An 8-bit quantized MLX reranker derived from Qwen3-0.6B, optimized for Apple Silicon.

context window 32K tokens
max output 32K tokens
fine-tunableopen-weights