polyglot-sql (Python)

Rust-powered SQL transpiler for 30+ dialects.

The polyglot-sql Python package exposes an API backed by the Rust polyglot-sql engine for fast parse/transpile/generate/format/validate workflows.

Installation

pip install polyglot-sql

Quick Start

import polyglot_sql

polyglot_sql.transpile(
    "SELECT IFNULL(a, b) FROM t",
    read="mysql",
    write="postgres",
)
# ["SELECT COALESCE(a, b) FROM t"]

ast = polyglot_sql.parse_one("SELECT 1 + 2", dialect="postgres")
polyglot_sql.generate(ast, dialect="mysql")

polyglot_sql.format_sql("SELECT a,b FROM t WHERE x=1", dialect="postgres")

Format Guard Behavior

format_sql uses Rust core formatting guards with default limits:

input bytes: 16 * 1024 * 1024
tokens: 1_000_000
AST nodes: 1_000_000
set-op chain: 256

import polyglot_sql

try:
    pretty = polyglot_sql.format_sql("SELECT 1", dialect="generic")
except polyglot_sql.GenerateError as exc:
    # Guard failures contain E_GUARD_* codes in the message.
    print(str(exc))

Per-call guard overrides:

pretty = polyglot_sql.format_sql(
    "SELECT 1 UNION ALL SELECT 2",
    dialect="generic",
    max_set_op_chain=1024,
    max_input_bytes=32 * 1024 * 1024,
)

result = polyglot_sql.validate("SELECT 1", dialect="postgres")
if result:
    print("valid")

API Reference

All functions are exported from polyglot_sql.

transpile(sql: str, read: str = "generic", write: str = "generic", *, pretty: bool = False) -> list[str]
parse(sql: str, dialect: str = "generic") -> list[dict]
parse_one(sql: str, dialect: str = "generic") -> dict
generate(ast: dict | list[dict], dialect: str = "generic", *, pretty: bool = False) -> list[str]
format_sql(sql: str, dialect: str = "generic", *, max_input_bytes: int | None = None, max_tokens: int | None = None, max_ast_nodes: int | None = None, max_set_op_chain: int | None = None) -> str
format(sql: str, dialect: str = "generic", *, max_input_bytes: int | None = None, max_tokens: int | None = None, max_ast_nodes: int | None = None, max_set_op_chain: int | None = None) -> str (alias of format_sql)
validate(sql: str, dialect: str = "generic") -> ValidationResult
optimize(sql: str, dialect: str = "generic") -> str
lineage(column: str, sql: str, dialect: str = "generic") -> dict
source_tables(column: str, sql: str, dialect: str = "generic") -> list[str]
diff(sql1: str, sql2: str, dialect: str = "generic") -> list[dict]
dialects() -> list[str]
__version__: str

Supported Dialects

Current dialect names returned by polyglot_sql.dialects():

athena, bigquery, clickhouse, cockroachdb, datafusion, databricks, doris, dremio, drill, druid, duckdb, dune, exasol, fabric, generic, hive, materialize, mysql, oracle, postgres, presto, redshift, risingwave, singlestore, snowflake, solr, spark, sqlite, starrocks, tableau, teradata, tidb, trino, tsql.

Error Handling

Exception hierarchy:

PolyglotError
ParseError
GenerateError
TranspileError
ValidationError

Unknown dialect names raise built-in ValueError.

validate(...) returns ValidationResult:

result.valid: bool
result.errors: list[ValidationErrorInfo]
bool(result) works (True when valid)

Each ValidationErrorInfo has:

message: str
line: int
col: int
code: str
severity: str

Performance Note

The package uses Rust internals directly via PyO3 and has zero runtime Python dependencies for SQL processing.

Development

cd crates/polyglot-sql-python
uv sync --group dev
uv run maturin develop
uv run pytest
uv run pyright python/polyglot_sql/
uv run maturin build --release
uv run --with mkdocs mkdocs build --strict --clean --config-file mkdocs.yml --site-dir ../../packages/python-docs/dist

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

polyglot-sql (Python)

Installation

Quick Start

Format Guard Behavior

API Reference

Supported Dialects

Error Handling

Performance Note

Development

Links

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

polyglot-sql (Python)

Installation

Quick Start

Format Guard Behavior

API Reference

Supported Dialects

Error Handling

Performance Note

Development

Links