Qwen3 Embedding 4B W4A16 G128

JSON →
boboliu embedding
text

A 4-bit quantized embedding model based on Qwen3 with 4B parameters, using W4A16 and group size 128 for efficient retrieval.

context window 41K tokens
max output 41K tokens
fine-tunable
releasedApr 2025