feat(conversation): hard-cut text2sql v2 workflow spine and RAG runtime gates by LienJack · Pull Request #23 · LienJack/text2sql

LienJack · 2026-04-28T17:24:09Z

Summary

hard-cut the conversation flow onto the Text2SQL v2 workflow spine and LangGraph runtime, including new stage contracts, artifact references, delivery mapping, and evaluation gates
add RAG task configuration, provider health probing, retrieval and rerank routing, and the Prisma-backed task config model needed for runtime selection
update the settings experience, shared API contracts, smoke coverage, and graphify artifacts so the new RAG configuration and runtime intelligence paths are visible end to end

Testing

Not run in this session

- Added `AGENT_RAG_RETRIEVAL_ENABLED` to the environment configuration, allowing for toggling of RAG retrieval functionality. - Updated `AppConfigService` to include a method for retrieving the new RAG retrieval setting. - Enhanced `RetrieveKnowledgeNode` to respect the RAG retrieval configuration, providing a fallback mechanism when retrieval is disabled. - Removed unused `GraphService` and related components to streamline the codebase and improve maintainability.

- Updated the `generatedAt` timestamps in gate summary JSON files for relationship platform, semantic spine, and modeling parity shadows to reflect the latest generation time. - Changed `strictMode` from `true` to `false` in the modeling parity shadow gate summary to allow for more flexible processing. - Enhanced the semantic plan service to handle degraded contexts without grounding evidence, ensuring the SQL path is maintained in such scenarios. - Introduced a tolerance for gate comparison in the text2sql evaluation service to prevent false negatives due to rounding errors.

- Added new endpoints for managing RAG task configurations, including listing, upserting, and health checking. - Implemented a new `RagTaskConfig` model and service to handle RAG task settings. - Enhanced the `SettingsService` to integrate RAG task configuration management. - Updated the `EmbeddingRouterService` and related components to utilize RAG task configurations for embedding runtime resolution. - Introduced coverage gap handling in the semantic plan service to improve error reporting and decision-making in SQL generation.

- Introduced new evaluation gates for Text2SQL v2, including strict modes for evaluation and focused coverage. - Updated CI references in documentation to reflect new gate commands and their expected outputs. - Enhanced the text2sql evaluation service to include detailed traceability metrics, ensuring comprehensive coverage reporting. - Added new traceability fields in evaluation cases to track fixture families, behavior tests, and flow nodes. - Improved error handling in SQL generation to account for metadata and general queries, preventing unnecessary SQL execution. - Updated test fixtures to include traceability information for better integration testing.

…runtime paths

- Introduced a new method `runOperationalFailure` in `FormatAnswerNode` to handle operational failures with user guidance. - Updated `RetrieveKnowledgeNode` to support datasource and allowed tables in its input, improving context handling. - Enhanced `SqlGenerationService` to utilize semantic shortcuts for generating SQL queries, improving efficiency and error recovery. - Implemented checks in `SqlReadonlyTool` to prevent introspection of schema system tables, ensuring security compliance. - Added new tests for RAG provider model previews and SQL generation shortcuts, ensuring robust functionality and error handling.

Introduce trace.v2 runtimePlan, artifactRefs, and Smart Defaults evidence across the shared contract, LangGraph runtime, delivery projection, and stream state summaries.

Extend focused and eval gates so runtime intelligence requires producer-backed artifact refs, stream lifecycle coverage, Smart Defaults evidence, and no-legacy owner scans.

Define additive plan ledger obligations, generation claims, validation fulfillment, and delivery summary fields so the v2 trace can carry SQL intent contracts without changing runtime plan semantics.

Build semantic plan obligations, block failed hard blockers before SQL generation, record fulfillment claims on generated SQL, and validate SQL against the ledger before execution or correction.

Add ledger fixture families, rollout metrics, and unit coverage for obligation creation, generation claims, validation fulfillment, correction grounding, and safe delivery surfaces.

LienJack added 30 commits April 26, 2026 01:13

refactor(conversation): add text2sql workflow spine runner

52638d0

refactor(text2sql): hard cut to direct v2 runtime

371f44d

refactor(text2sql): make v2 runtime stage-owned and policy-driven

a494422

feat(text2sql): strengthen retrieval context contract and reindex flow

d12c233

feat(text2sql): enforce semantic plan and validation correction gates

2b99f43

refactor(text2sql): hard-cut delivery read-model and strengthen v2 gates

f9c810c

feat(conversation): cut over text2sql v2 seam to langgraph runtime

2395c2e

refactor(backend): hard-cut text2sql v2 langgraph runtime

09323ae

refactor(backend): align langgraph v2 contracts and tracing gates

33fd264

feat(text2sql-v2): complete full mermaid strict-completion closeout

15c0dcf

feat(conversation): land v2 runtime seam and rag config onboarding

f2cea37

refactor(conversation): finalize hard-cut guard layering

19dde31

refactor(conversation): hard-cut v2 compatibility layers and flatten …

c1f6b67

…runtime paths

feat(text2sql-v2): add runtime intelligence artifacts

35a5be1

Introduce trace.v2 runtimePlan, artifactRefs, and Smart Defaults evidence across the shared contract, LangGraph runtime, delivery projection, and stream state summaries.

test(text2sql-v2): gate runtime intelligence coverage

af14e7c

Extend focused and eval gates so runtime intelligence requires producer-backed artifact refs, stream lifecycle coverage, Smart Defaults evidence, and no-legacy owner scans.

chore(graphify): refresh text2sql runtime graph

b4d9774

fix(frontend): stabilize relationship edit dialog test

a131c60

feat(text2sql-v2): add plan ledger contracts

19f235b

Define additive plan ledger obligations, generation claims, validation fulfillment, and delivery summary fields so the v2 trace can carry SQL intent contracts without changing runtime plan semantics.

feat(text2sql-v2): enforce plan ledger fulfillment

ef2c367

Build semantic plan obligations, block failed hard blockers before SQL generation, record fulfillment claims on generated SQL, and validate SQL against the ledger before execution or correction.

test(text2sql-v2): cover plan ledger accuracy gates

9848141

Add ledger fixture families, rollout metrics, and unit coverage for obligation creation, generation claims, validation fulfillment, correction grounding, and safe delivery surfaces.

chore(graphify): refresh text2sql plan ledger graph

668d1f9

fix(text2sql-v2): record real stage durations

4c2551b

fix(text2sql-v2): narrow ledger column obligations

e5b6715

chore(graphify): refresh text2sql ledger graph

d65fb03

fix(settings): clear rag api key on runtime target changes

3bdd7b2

LienJack added 2 commits April 29, 2026 02:52

fix(architecture): route artifact refs through knowledge entry

5558daf

docs(ce-review): add summary for PR #23 findings and residual risks

1a974be

LienJack merged commit 2c1dd43 into dev Apr 28, 2026
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(conversation): hard-cut text2sql v2 workflow spine and RAG runtime gates#23

feat(conversation): hard-cut text2sql v2 workflow spine and RAG runtime gates#23
LienJack merged 32 commits into
devfrom
codex/refactor-text2sql-workflow-spine

LienJack commented Apr 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LienJack commented Apr 28, 2026

Summary

Testing

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant