Skip to content

feat(homelab): set flinker/qwen3.6-35b-a3b as default model with llama.cpp config#69

Merged
mrsimpson merged 4 commits into
mainfrom
feat/set-default-model-flinker-llama-cpp-config
May 19, 2026
Merged

feat(homelab): set flinker/qwen3.6-35b-a3b as default model with llama.cpp config#69
mrsimpson merged 4 commits into
mainfrom
feat/set-default-model-flinker-llama-cpp-config

Conversation

@mrsimpson
Copy link
Copy Markdown
Owner

@mrsimpson mrsimpson commented May 19, 2026

Summary

  • Set flinker/qwen3.6-35b-a3b as the default model for all new session pods
  • Inject top_k=20, top_p=0.95, temperature=0.6 into the qwen model config for llama.cpp (per Qwen model card coding recommendations)
  • Switch from Anthropic API keys to OpenRouter (free + paid tiers)
  • Add dynamic model fetching from OpenRouter at deploy time
  • Add attach service for router attach port (4096)
  • Add admin secret for CI endpoints (e.g. /api/admin/pull-image)
  • Update operator sidecar env vars (ROUTER_ADMIN_URL, attach config, admin secret)
  • Change oauth2Proxy group from users to developers
  • Route session traffic through Traefik for proper oauth2 middleware chain

Key Decisions

  1. Default model — Root model field in dynamic ConfigMap's opencode.json
  2. llama.cpp options — Per Qwen model card "Thinking mode for precise coding tasks": temperature=0.6, top_p=0.95, top_k=20. Injected via model.options in parseFlinkerModel for qwen3.6-35b-a3b only.
  3. Provider switch — Anthropic → OpenRouter (free tier + 20 most-popular paid programming models)
  4. Traefik routing — Session traffic now goes through Traefik to ensure ForwardAuth → oauth2-chain middleware runs

Changes

  • deployment/homelab/src/index.ts — ConfigMap, providers, secrets, services, env vars
  • .vibe/development-plan-feat-set-default-model-flinker-llama-cpp-config.md — Development plan

mrsimpson added 4 commits May 18, 2026 18:51
…a.cpp config

- Add root model field to dynamic ConfigMap for flinker/qwen3.6-35b-a3b
- Inject top_k and temperature options for qwen model via parseFlinkerModel
- Switch from Anthropic to OpenRouter API keys
- Add dynamic OpenRouter model fetching (free + paid)
- Add attach service for router attach port
- Add admin secret for CI endpoints
- Update operator sidecar env vars (ROUTER_ADMIN_URL, attach config)
- Change oauth2Proxy group to developers
- Update ROUTER_SERVICE_URL to point to Traefik
…a.cpp config

- Add root model field to dynamic ConfigMap for flinker/qwen3.6-35b-a3b
- Inject top_k: 20, top_p: 0.95, temperature: 0.6 for qwen model (per Qwen model card coding recommendations)
- Switch from Anthropic to OpenRouter API keys
- Add dynamic OpenRouter model fetching (free + paid)
- Add attach service for router attach port
- Add admin secret for CI endpoints
- Update operator sidecar env vars (ROUTER_ADMIN_URL, attach config)
- Change oauth2Proxy group to developers
- Update ROUTER_SERVICE_URL to point to Traefik
@mrsimpson mrsimpson merged commit a87861b into main May 19, 2026
4 checks passed
@mrsimpson mrsimpson deleted the feat/set-default-model-flinker-llama-cpp-config branch May 19, 2026 06:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant