Vortex Data

0.68.0 · active · verified Thu Apr 16

Vortex Data (`vortex-data`) provides Python bindings for Vortex, an Apache Arrow-compatible toolkit designed for working with compressed array data. It focuses on efficient storage, processing, and retrieval of large datasets, often used in scenarios requiring high-performance data analytics and database systems. The library is actively developed with frequent, often monthly, releases, currently at version 0.68.0.

Common errors

Warnings

Install

Imports

Quickstart

Demonstrates creating Vortex arrays from scalars and NumPy arrays, and converting them to Apache Arrow arrays.

import vortex
import numpy as np

# Create a Vortex scalar array with an explicit dtype
scalar_arr = vortex.array.scalar(10, dtype=vortex.DType.Int32)
print(f"Scalar Array: {scalar_arr}\n")

# Create a Vortex array from a NumPy array
numpy_arr = np.arange(10, dtype=np.int32)
complex_arr = vortex.array.array(numpy_arr)
print(f"Array from NumPy: {complex_arr}\n")

# Get its size and convert to Apache Arrow
print(f"Size in bytes: {complex_arr.nbytes}\n")
arrow_array = complex_arr.to_arrow()
print(f"Converted to Arrow: {arrow_array}")

view raw JSON →