Support for Codex CLI by skipping unsupported Responses tools by SidShaytay · Pull Request #23041 · ggml-org/llama.cpp

SidShaytay · 2026-05-14T04:59:32Z

Overview

This enables support for codex CLI, which now uses the Responses API. As per https://platform.openai.com/docs/guides/tools?api-mode=responses type's can be beyond just function, like file_search, web_search , mcp, image_generation, namespace, etc. llama.cpp can't support each type but instead of breaking down entirely, we only pass the ones we can support to the backend. The patch is intentionally minimal as there isn't a full implementation of Responses in llama.cpp as far as I can tell. This is merely making the compatibility shim (Responses <-> Chat completion) less brittle.

Issue faced by users @ #20156

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: Codex 5.5 (API) medium used to rapidly query/understand the high level flow and cross-verify Responses API vs Chat Completion API handling within llama.cpp. The unit tests were entirely written by Codex as per guidance to cover both +ve and -ve cases.

aldehir

I think this is better than the other PR, which tries to add support for everything.

OpenAI is constantly changing Codex, it is infeasible to expect llama.cpp to maintain feature parity.

pwilkin · 2026-05-14T08:59:01Z

Agreed with @aldehir , we should be selective about what we add. Might be able to add more later once we figure out the shape of the tooling protocols.

SidShaytay · 2026-05-14T19:29:30Z

To clarify, the goal was never to add 'full functionality' of Responses API here. That's a moving target as OpenAI continually adds server-hosted tools (web_search , code_interpreter, even mcp which are mcps on openai's side, not your client machine etc). Plus it's a major design change, better served by something where llama-server handles just the API/handshake with clients, while offloading various tools invoked within Responses API to another service (say, a separately maintained tools container). However, complete Responses API functionality is outside the scope of this PR.

With that out of the way, this PR's intent is to narrowly allow llama.cpp to be more resilient / less brittle to Responses API clients like codex. @aldehir 's suggestions of

emitting warnings when unsupported tools are discovered
=> I'm aligned, it's added with minimal code
continue to throw for the codex + gpt-oss combo
=> It's now added to the PR because I'd like to get done and move on, BUT unlike @aldehir's comment, I am not seeing codex 0.130.0 advertise apply_patch at my end even with codex exec + gpt-oss-20b model. The apply_patch string is found inside the codex v0.130 rust binary but it's trigger point is opaque. I'm in slight favor to not adding any legacy support in fresh code when the target (codex) itself appears to have moved on. LMK, I can reverse just this bit as handled in the original PR (no special treatment for apply_patch)

Action: Reviewers to review

CC: @pwilkin

aldehir · 2026-05-14T20:08:28Z

Yes, it does seem that was changed in Codex. Previously it would define apply_patch as a freeform tool for gpt-oss-120b.

Strip the logic out, I apologize for the misdirection. Everything else looks good.

…ction

SidShaytay · 2026-05-14T22:30:34Z

@aldehir - no worries, all good. I've reverted gpt-oss apply_patch special handling (and it's 2x tests). Also rebased to latest master, should merge cleanly.

If all looks good, proceed?

pwilkin · 2026-05-15T07:03:19Z

@SidShaytay no, it's fine, we were referring to the other huge Responses API PR :)

…rg#23041) * Support for Codex CLI by skipping unsupported Responses tools * Warn on skipped Responses tools and preserve gpt-oss apply_patch rejection * Revert gpt-oss apply_patch special handling

SidShaytay requested review from a team and pwilkin as code owners May 14, 2026 04:59

SidShaytay mentioned this pull request May 14, 2026

Eval bug: Codex, 'type' of tool must be 'function' #20156

Open

github-actions Bot added testing Everything test related examples server labels May 14, 2026

aldehir reviewed May 14, 2026

View reviewed changes

Comment thread tools/server/server-chat.cpp

SidShaytay requested review from aldehir May 14, 2026 17:04

SidShaytay added 3 commits May 14, 2026 15:25

Support for Codex CLI by skipping unsupported Responses tools

57066ab

Warn on skipped Responses tools and preserve gpt-oss apply_patch reje…

1d042ad

…ction

Revert gpt-oss apply_patch special handling

28b7457

SidShaytay force-pushed the fix-responses-non-function-tools branch from de6562f to 28b7457 Compare May 14, 2026 22:26

aldehir approved these changes May 14, 2026

View reviewed changes

pwilkin approved these changes May 15, 2026

View reviewed changes

pwilkin merged commit 91e84fe into ggml-org:master May 15, 2026
50 checks passed

This was referenced May 17, 2026

tools[].type = "namespace" is silently dropped — MCP tools are unusable with Codex CLI #23229

Open

Codex wraps MCP tools in type:"namespace" for custom/local providers — backends with strict tools schemas cannot unwrap, MCP unusable openai/codex#23186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Codex CLI by skipping unsupported Responses tools#23041

Support for Codex CLI by skipping unsupported Responses tools#23041
pwilkin merged 3 commits into
ggml-org:masterfrom
SidShaytay:fix-responses-non-function-tools

SidShaytay commented May 14, 2026

Uh oh!

aldehir left a comment

Uh oh!

Uh oh!

pwilkin commented May 14, 2026

Uh oh!

SidShaytay commented May 14, 2026 •

edited

Loading

Uh oh!

aldehir commented May 14, 2026

Uh oh!

SidShaytay commented May 14, 2026

Uh oh!

pwilkin commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

SidShaytay commented May 14, 2026

Overview

Requirements

Uh oh!

aldehir left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pwilkin commented May 14, 2026

Uh oh!

SidShaytay commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aldehir commented May 14, 2026

Uh oh!

SidShaytay commented May 14, 2026

Uh oh!

pwilkin commented May 15, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SidShaytay commented May 14, 2026 •

edited

Loading