NIXL Python API (CUDA 12)

1.0.0 · active · verified Tue Apr 14

NIXL (NVIDIA Inference Xfer Library) is an open-source Python API designed to accelerate point-to-point communications in AI inference frameworks. It provides a unified abstraction layer over various memory types (CPU, GPU) and storage (file, block, object store) through a modular plugin architecture. The `nixl-cu12` package specifically targets CUDA 12 environments. NIXL is actively maintained with frequent releases, with version 1.0.0 being the current stable release.

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to import the `nixl` library and initialize a NIXL agent. The agent is the core component for managing data transfers within the NIXL framework.

import nixl

# Initialize a NIXL agent
try:
    agent = nixl.nixl_agent('my_inference_agent')
    print(f"NIXL agent '{agent.name}' initialized successfully.")
    # Further NIXL operations would follow here, e.g., memory registration, transfer requests.
except Exception as e:
    print(f"Error initializing NIXL agent: {e}")

view raw JSON →