BGE Reranker v2 M3 Q8_0 GGUF

JSON →
pqnet reranking
text

An 8-bit quantized GGUF version of the BGE-Reranker-v2-M3 model for efficient reranking.

context window 8K tokens
max output 8K tokens
input price $0.01 / 1M tokens
output price $0.01 / 1M tokens
fine-tunableopen-weights