PyMilvus Model Components

0.3.2 · active · verified Thu Apr 16

PyMilvus Model is a Python library that provides model components, primarily for generating dense and sparse embeddings, intended for use with Milvus. It leverages popular deep learning frameworks like Hugging Face Transformers and Sentence-Transformers to offer a unified interface for various pre-trained models. The current version is 0.3.2, and it typically releases updates as new features or model integrations become available, often in sync with PyMilvus SDK developments.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to initialize a `DenseEncoder` and use it to generate embeddings for a list of texts. It uses a popular sentence transformer model. Remember to specify `device='cuda'` if you have a GPU for faster encoding and have PyTorch installed with CUDA support.

from pymilvus_model.dense.encoder import DenseEncoder

# Initialize a DenseEncoder with a common embedding model
# Ensure the model name is valid and accessible (e.g., from Hugging Face Model Hub)
encoder = DenseEncoder(model_name='BAAI/bge-small-en-v1.5', device='cpu') # Change to 'cuda' if GPU available and configured

texts = [
    'The quick brown fox jumps over the lazy dog.',
    'Artificial intelligence is rapidly advancing.',
    'Milvus is an open-source vector database.'
]

# Encode documents to get dense embeddings
embeddings = encoder.encode_documents(texts)

print(f"Encoded {len(texts)} texts.")
print(f"Shape of embeddings: {embeddings.shape}")
print(f"First embedding (truncated): {embeddings[0][:5]}...")

view raw JSON →