chDB

4.1.6 · active · verified Thu Apr 09

chDB is an in-process OLAP SQL Engine powered by ClickHouse, enabling users to embed a powerful analytical database directly within their Python applications. It allows running SQL queries on various data formats (Parquet, CSV, JSON, Pandas DataFrames) without needing a separate database server. Currently at version 4.1.6, chDB maintains an active development and release cadence, frequently adding features and improvements.

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to execute a basic SQL query using `chdb.query` and receive the results directly as a Pandas DataFrame. It also shows how to query an existing Pandas DataFrame using ClickHouse SQL syntax via the `python(df_name)` table function.

import chdb
import pandas as pd

# Run a simple SQL query and get results as a Pandas DataFrame
result_df = chdb.query("SELECT 1 as id, 'Hello chDB!' as message, version() as chdb_version", "DataFrame")
print("Query Result (DataFrame):\n", result_df)

# Query an existing Pandas DataFrame directly
data = {'col1': [1, 2, 3], 'col2': ['A', 'B', 'C']}
mypandas_df = pd.DataFrame(data)
sql_on_df = "SELECT col1, upper(col2) FROM python(mypandas_df) WHERE col1 > 1"
queried_df_from_pandas = chdb.query(sql_on_df, "DataFrame")
print("\nQuery Result from Pandas DataFrame (DataFrame):\n", queried_df_from_pandas)

view raw JSON →