Add general-purpose SQL Statement Execution engine by pietern · Pull Request #5416 · databricks/cli

pietern · 2026-06-02T18:56:43Z

Summary

Adds libs/sqlexec, a general-purpose, non-interactive engine for running SQL through the Databricks SQL Statement Execution API, and refactors the experimental aitools query commands to use it instead of each re-implementing the submit/poll/fetch loop.

A Client binds to a single SQL warehouse and exposes the full lifecycle: Submit (async, returns immediately with a statement ID so callers can wire up cancellation), Poll (additive backoff between status checks), Get, Cancel, Results, and the convenience wrappers Execute and ExecuteScalar. Failures surface as a typed *StatementError carrying the terminal State, Code, and Message, so callers compare with errors.As rather than string-matching. The Client holds no mutable state and is safe for concurrent use — aitools fans many statements out through one instance.

The engine speaks only the INLINE disposition with JSON_ARRAY format (the API caps this at 25 MiB per result set), which covers every caller today; EXTERNAL_LINKS is intentionally left out as a separate concern. It exists to be shared by programmatic callers such as bundle deploy resources (e.g. metric views, which have no REST API and are managed via SQL DDL) and the aitools commands.

The aitools consumers (query.go, batch.go, discover_schema.go, statement*.go) are reworked to delegate to the engine, removing the duplicated polling and result-assembly code (net ~549 lines deleted from those files).

Test plan

Hermetic unit + HTTP tests in libs/sqlexec covering path/response decoding, polling, cancellation, parameters, and error mapping.
Live integration coverage in integration/libs/sqlexec (skips without CLOUD_ENV / TEST_DEFAULT_WAREHOUSE_ID); all 6 tests verified green against a real workspace.

This pull request and its description were written by Isaac.

Add libs/sqlexec, a reusable client over the Databricks SQL Statement Execution API: submit (sync or async), poll to terminal with additive backoff, paginate result chunks, cancel, and turn terminal non-success states into a typed StatementError. INLINE + JSON_ARRAY only (the 25 MiB cap covers every caller); EXTERNAL_LINKS is intentionally out of scope. Migrate the experimental aitools query/batch/discover-schema/statement commands onto the engine, deleting their duplicated poll/fetch/terminal /error helpers. CLI-specific concerns (signal handling, spinner, batch JSON shapes, the UNRESOLVED_MAP_KEY hint) stay in aitools as a thin presenter over the engine's structured results and errors. Co-authored-by: Isaac

Add two durable test layers beyond the mock-interface unit tests: - libs/sqlexec/sqlexec_http_test.go: drives the engine through a real SDK client over HTTP against testserver with the statement endpoints programmed per test. Covers the request/response JSON round-trip for sync success, polling across GetStatement, result-chunk pagination, the FAILED-as-HTTP-200 quirk, and submit+cancel. Hermetic; runs every PR. - integration/libs/sqlexec: exercises the same paths against a live warehouse in the nightly suite (skips without CLOUD_ENV / TEST_DEFAULT_WAREHOUSE_ID). Also address review: Statement.Columns() exposes column metadata from the manifest so `statement get` still surfaces columns when a later result chunk fetch fails (restores the previously-dropped behavior + assertion). Co-authored-by: Isaac

eng-dev-ecosystem-bot · 2026-06-02T21:18:26Z

Commit: eef24de

Run: 26875352021

simonfaltum

LGTM. Reviewed the new sqlexec engine and aitools migration; focused tests passed locally. The external Integration Tests check currently reports "Report generation failed" without test annotations, so I don't see a code-review blocker here.

The shared classic warehouse on the non-UC azure-prod workspace fails to launch clusters, so the statement sat pending ~75 min and timed out the job. Gate newClient on TEST_METASTORE_ID, which is set only on the *-ucws environments, so the tests run only where a working UC warehouse is available. CLOUD_ENV can't make this distinction because azure-prod-ucws and the non-UC azure-prod both report CLOUD_ENV=azure. Co-authored-by: Isaac

eng-dev-ecosystem-bot · 2026-06-03T15:58:59Z

Commit: 4008bbe

Run: 26879815490

pietern added 2 commits June 2, 2026 19:46

pietern requested a review from simonfaltum June 2, 2026 18:57

pietern temporarily deployed to test-trigger-is June 2, 2026 18:57 — with GitHub Actions Inactive

simonfaltum approved these changes Jun 2, 2026

View reviewed changes

pietern temporarily deployed to test-trigger-is June 3, 2026 09:15 — with GitHub Actions Inactive

pietern added this pull request to the merge queue Jun 3, 2026

Merged via the queue into main with commit 4008bbe Jun 3, 2026
28 checks passed

pietern deleted the libs-sqlexec branch June 3, 2026 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add general-purpose SQL Statement Execution engine#5416

Add general-purpose SQL Statement Execution engine#5416
pietern merged 3 commits into
mainfrom
libs-sqlexec

pietern commented Jun 2, 2026

Uh oh!

eng-dev-ecosystem-bot commented Jun 2, 2026 •

edited

Loading

Uh oh!

simonfaltum left a comment

Uh oh!

Uh oh!

eng-dev-ecosystem-bot commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pietern commented Jun 2, 2026

Summary

Test plan

Uh oh!

eng-dev-ecosystem-bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonfaltum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eng-dev-ecosystem-bot commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

eng-dev-ecosystem-bot commented Jun 2, 2026 •

edited

Loading