Pinecone Text Client

0.11.0 · active · verified Thu Apr 16

The Pinecone Text Client is a Python package that provides text utilities for generating sparse, dense, and hybrid vector embeddings. It is designed for seamless integration with Pinecone's vector database to facilitate sparse-dense (hybrid) semantic search. Currently, it is a public preview ('Beta') version, with the latest release being 0.11.0. Release cadence is infrequent, focusing on feature additions and improvements within its beta phase.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to initialize the `OpenAIEncoder` and use it to generate dense vector embeddings for both documents and queries. It highlights the necessity of setting the `OPENAI_API_KEY` environment variable.

import os
from pinecone_text.dense import OpenAIEncoder

# Ensure OPENAI_API_KEY is set in your environment
# For quick testing, you can uncomment and set it directly, but prefer environment variables
# os.environ["OPENAI_API_KEY"] = "YOUR_OPENAI_API_KEY"

if not os.environ.get("OPENAI_API_KEY"):
    print("Error: OPENAI_API_KEY environment variable not set.")
    print("Please set it or uncomment the line above for testing.")
else:
    try:
        encoder = OpenAIEncoder() # Defaults to 'text-embedding-3-small'
        documents = [
            "The quick brown fox jumps over the lazy dog",
            "Artificial intelligence is transforming industries"
        ]
        queries = [
            "Who jumped over the lazy dog?",
            "What is AI doing?"
        ]

        document_vectors = encoder.encode_documents(documents)
        query_vectors = encoder.encode_queries(queries)

        print(f"Encoded document 1 vector (first 5 elements): {document_vectors[0][:5]}...")
        print(f"Encoded query 1 vector (first 5 elements): {query_vectors[0][:5]}...")
    except Exception as e:
        print(f"An error occurred during encoding: {e}")

view raw JSON →