Databricks SQL

1.0.0 · active · verified Fri Apr 17

Databricks SQL is a Python framework designed for easy interaction with Databricks SQL Endpoints. It provides a fluent API for building and executing SQL queries, simplifying data operations for Python developers. The library recently reached version 1.0.0, indicating a stable API after rapid initial development.

Common errors

Warnings

Install

Imports

Quickstart

This quickstart demonstrates how to connect to Databricks SQL using environment variables for authentication and perform a simple SELECT query. Replace `your_schema.users` with an actual table in your Databricks workspace. It highlights the fluent API pattern for query construction.

import os
from databricks_sql import DatabricksSQL

# Ensure these environment variables are set for authentication
# DATABRICKS_SERVER_HOSTNAME (e.g., 'dbc-xxxx.cloud.databricks.com')
# DATABRICKS_HTTP_PATH (e.g., '/sql/1.0/endpoints/xxxx')
# DATABRICKS_ACCESS_TOKEN (Databricks personal access token)

# Initialize the DatabricksSQL client
db_sql = DatabricksSQL(
    server_hostname=os.environ.get("DATABRICKS_SERVER_HOSTNAME", ""),
    http_path=os.environ.get("DATABRICKS_HTTP_PATH", ""),
    access_token=os.environ.get("DATABRICKS_ACCESS_TOKEN", "")
)

try:
    # Example: Select data from a table named 'users'
    # Replace 'your_schema.users' with an actual table in your Databricks workspace
    result = db_sql.select("id", "name").from_table("your_schema.users").limit(5).fetch_all()
    print("Fetched data:")
    for row in result:
        print(row)

    # Example: Insert data (if table allows)
    # db_sql.insert().into_table("your_schema.new_users").columns("id", "name").values(1, "Alice").execute()
    # print("Data inserted.")

except Exception as e:
    print(f"An error occurred: {e}")
    print("Please ensure your Databricks connection details (server_hostname, http_path, access_token) are correctly configured.")

view raw JSON →