🤖 fix: stabilize CoderProvisioner reconcile and startup#73
Conversation
|
@codex review |
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 8118ac233d
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
Fixes CoderProvisioner bring-up failures and reconcile churn by: - granting manager RBAC verbs required to create delegated provisioner Roles - preventing coderd key metadata-backfill retries from repeatedly rotating keys - launching provisioners with explicit command/args compatible with coder images - avoiding CODER_ORGANIZATION injection for key-auth provisioners - updating tests and generated CRD/docs to match behavior --- _Generated with `mux` • Model: `openai:gpt-5.3-codex` • Thinking: `xhigh` • Cost: `$0.00`_ <!-- mux-attribution: model=openai:gpt-5.3-codex thinking=xhigh costs=0.00 -->
8118ac2 to
2bbf959
Compare
|
@codex review Addressed the review feedback by restoring metadata revalidation/rotation in the empty-status branch and persisting key-related status immediately after key+secret reconciliation to avoid repeated coderd churn on later retries. |
|
Codex Review: Didn't find any major issues. Bravo. ℹ️ About Codex in GitHubYour team has set up Codex to review pull requests in this repo. Reviews are triggered when you
If Codex has suggestions, it will comment; otherwise it will react with 👍. Codex can also answer questions or update the PR. Try commenting "@codex address that feedback". |
Summary
This PR fixes CoderProvisioner bring-up and reconcile stability issues discovered in smoke testing.
Background
During an end-to-end smoke test,
CoderProvisionerfailed to reach Ready due to RBAC escalation issues and provisioner pod startup failures. Logs also showed repeated metadata backfill/rotation activity that could repeatedly hit coderd and trigger rate limiting when reconcile retries occurred.Implementation
pods(get/list/watch/create/update/patch/delete)persistentvolumeclaims(get/list/watch/create/update/patch/delete)config/rbac/role.yamlincludes those verbs.command: ["coder"]coder provisioner startCODER_ORGANIZATIONinjection for key-auth daemon startup (avoids CLI rejection when--keyis used).Validation
make verify-vendormake testmake buildmake lintmake test-integrationmake manifestsmake docs-referenceCoderProvisionerreachesphase=Ready,readyReplicas=1successfully connected to coderdRisks
CoderProvisionerreconcile/deployment composition and generated RBAC/docs.Generated with
mux• Model:openai:gpt-5.3-codex• Thinking:xhigh• Cost:$0.00