Sanitize MCP tool output items in openai-agents-sdk template by dhruv0811 · Pull Request #119 · databricks/app-templates

dhruv0811 · 2026-02-12T23:25:09Z

Summary

MCP tools (e.g. Genie) can return output items where the output field is a list instead of a string, causing Pydantic validation errors in ResponsesAgentResponse
Adds sanitize_output_items / _sanitize_item to utils.py and wires it into both invoke and stream handlers

Temporary workaround until mlflow/mlflow#20777 is released.

MCP tools (e.g. Genie) can return list outputs that fail Pydantic validation. Add sanitize_output_items to both invoke and stream paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

This PR adds sanitization of MCP tool output items to fix Pydantic validation errors that occur when the output field is a list instead of a string. The changes introduce helper functions to convert non-string output values to JSON strings before passing them to ResponsesAgentResponse.

Changes:

Added _sanitize_item and sanitize_output_items helper functions to convert list-type output fields to JSON strings
Applied sanitization in both the invoke handler (for non-streaming responses) and the stream event handler (for streaming responses)
Reorganized imports in agent.py to be alphabetically sorted

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
agent-openai-agents-sdk/agent_server/utils.py	Added sanitization functions `_sanitize_item` and `sanitize_output_items` to handle MCP tool outputs with list-type output fields; integrated `_sanitize_item` into the stream event processing
agent-openai-agents-sdk/agent_server/agent.py	Updated imports to include `sanitize_output_items` and applied it to the `invoke` response output; reorganized imports alphabetically

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-13T00:28:07Z

+    if isinstance(input_item.get("output"), list):
+        input_item["output"] = json.dumps(input_item["output"])
+    return input_item


The function mutates the input dictionary directly, which can cause unintended side effects if the same dictionary is used elsewhere. Consider creating a copy of the dictionary before modifying it to avoid mutating the caller's data. For example, create a shallow copy at the start of the function with item = input_item.copy() and then modify and return the copy.

Suggested change

if isinstance(input_item.get("output"), list):

input_item["output"] = json.dumps(input_item["output"])

return input_item

item = input_item.copy()

if isinstance(item.get("output"), list):

item["output"] = json.dumps(item["output"])

return item

Copilot · 2026-02-13T00:28:08Z

+    a *list* of content objects instead of a plain string. MLflow's Pydantic
+    models expect ``output`` to be a string, so this serialises any non-string
+    values to JSON.
+
+    TODO: Remove once https://github.com/mlflow/mlflow/pull/20777 is released.
+    """
+    if isinstance(input_item.get("output"), list):
+        input_item["output"] = json.dumps(input_item["output"])


The sanitization only handles the case where output is a list. However, other non-string types (e.g., dict, int, bool, None) could also cause Pydantic validation errors. Consider checking if output is not a string rather than checking if it's specifically a list. For example: if "output" in input_item and not isinstance(input_item["output"], str): to handle all non-string types consistently.

Suggested change

a *list* of content objects instead of a plain string. MLflow's Pydantic

models expect ``output`` to be a string, so this serialises any non-string

values to JSON.

TODO: Remove once https://github.com/mlflow/mlflow/pull/20777 is released.

"""

if isinstance(input_item.get("output"), list):

input_item["output"] = json.dumps(input_item["output"])

a collection or other non-string value instead of a plain string. MLflow's

Pydantic models expect ``output`` to be a string, so this serialises any

non-string values to JSON.

TODO: Remove once https://github.com/mlflow/mlflow/pull/20777 is released.

"""

output_value = input_item.get("output")

if "output" in input_item and not isinstance(output_value, str):

input_item["output"] = json.dumps(output_value)

Signed-off-by: Bryan Qiu <bryan.qiu@databricks.com>

… with PR databricks#119 - Move Genie from standalone GENIE_SPACE_ID into SUBAGENTS as type "genie" with a description for the MCP server - Comment out all SUBAGENTS entries by default; single assert catches unconfigured templates - Update _sanitize_item to handle any non-string output (not just lists) with try/except fallback, matching the base template from PR databricks#119 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Sanitize MCP tool output items in openai-agents-sdk template

7e70d83

MCP tools (e.g. Genie) can return list outputs that fail Pydantic validation. Add sanitize_output_items to both invoke and stream paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

dhruv0811 requested review from bbqiu and Copilot February 13, 2026 00:24

Copilot started reviewing on behalf of dhruv0811 February 13, 2026 00:25 View session

Copilot AI reviewed Feb 13, 2026

View reviewed changes

bbqiu reviewed Feb 13, 2026

View reviewed changes

Comment thread agent-openai-agents-sdk/agent_server/utils.py

.

64c35a5

Signed-off-by: Bryan Qiu <bryan.qiu@databricks.com>

bbqiu approved these changes Feb 13, 2026

View reviewed changes

bbqiu merged commit 9cc741d into databricks:main Feb 13, 2026

bbqiu mentioned this pull request Feb 13, 2026

Add agent-openai-multiagent template #117

Merged

3 tasks

bbqiu mentioned this pull request Mar 5, 2026

Agent templates: e2e tests + remove unnecessary app.yaml files #143

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sanitize MCP tool output items in openai-agents-sdk template#119

Sanitize MCP tool output items in openai-agents-sdk template#119
bbqiu merged 2 commits into
databricks:mainfrom
dhruv0811:sanitize-mcp-output

dhruv0811 commented Feb 12, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-    if isinstance(input_item.get("output"), list):
-        input_item["output"] = json.dumps(input_item["output"])
-    return input_item
+    item = input_item.copy()
+    if isinstance(item.get("output"), list):
+        item["output"] = json.dumps(item["output"])
+    return item

-    a *list* of content objects instead of a plain string. MLflow's Pydantic
-    models expect ``output`` to be a string, so this serialises any non-string
-    values to JSON.
-    TODO: Remove once https://github.com/mlflow/mlflow/pull/20777 is released.
-    """
-    if isinstance(input_item.get("output"), list):
-        input_item["output"] = json.dumps(input_item["output"])
+    a collection or other non-string value instead of a plain string. MLflow's
+    Pydantic models expect ``output`` to be a string, so this serialises any
+    non-string values to JSON.
+    TODO: Remove once https://github.com/mlflow/mlflow/pull/20777 is released.
+    """
+    output_value = input_item.get("output")
+    if "output" in input_item and not isinstance(output_value, str):
+        input_item["output"] = json.dumps(output_value)

Conversation

dhruv0811 commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dhruv0811 commented Feb 12, 2026 •

edited

Loading