Skip to content

kleidiai: added kleidiai-server to server-self-hosted workflow#22435

Merged
CISC merged 3 commits into
ggml-org:masterfrom
martin-klacer-arm:feature/kleidiai_server_workflow
May 18, 2026
Merged

kleidiai: added kleidiai-server to server-self-hosted workflow#22435
CISC merged 3 commits into
ggml-org:masterfrom
martin-klacer-arm:feature/kleidiai_server_workflow

Conversation

@martin-klacer-arm
Copy link
Copy Markdown
Contributor

Overview

The patch adds a KleidiAI-enabled Arm64 Linux llama-server CI integration test workflow into the server-self-hosted.yml configuration file.

Additional information

Adding a server-based CI workflow as per the discussion under this PR: #19357 (comment).

Requirements

  • I have read and agree with the contributing guidelines
  • AI usage disclosure: YES - consultation about GitHub actions syntax (no directly AI-generated code)

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee
@martin-klacer-arm martin-klacer-arm requested a review from a team as a code owner April 27, 2026 13:26
@github-actions github-actions Bot added the devops improvements to build systems and github actions label Apr 27, 2026
@taronaeo
Copy link
Copy Markdown
Member

I'm assuming your self-hosted runner is being onboarded to the ggml-org organization to run this workflow right? I don't think I see a runner fitting the Linux, ARM64, llama-server requirement :)

@martin-klacer-arm
Copy link
Copy Markdown
Contributor Author

Thanks for the reply, and sorry for the wait. We are currently in the process of getting approval to use a self-hosted executor for this workflow, I'll update the PR once that becomes available.

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
@martin-klacer-arm
Copy link
Copy Markdown
Contributor Author

Hi, I added the correct approved self-hosted executor label. The runner should already be in use in another Workflow, so this just adds another job to it rather than adding an entirely new runner

@taronaeo taronaeo requested review from a team and CISC May 17, 2026 13:16
Comment thread .github/workflows/server-self-hosted.yml Outdated
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
@CISC CISC merged commit 053e01d into ggml-org:master May 18, 2026
3 checks passed
Jcfunk added a commit to Jcfunk/llama.cpp that referenced this pull request May 19, 2026
* master: (100 commits)
  Agent update
  hexagon: add support for TRI op (ggml-org#22822)
  ggml-hexagon: add PAD op HVX kernel (ggml-org#23078)
  docker : add OCI image labels for version and build date (ggml-org#21653)
  common : remove hf cache migration (ggml-org#23266)
  ui: Update KaTeX package and clean up logs from `sass` warnings (ggml-org#23275)
  feat: add scroll-to-bottom button to chat + prevent forced scroll down (ggml-org#23270)
  ui: Refactor models store, MCP service, and gate logs behind VITE_DEBUG (ggml-org#23236)
  ui: Centralize monospace font styles in app.css (ggml-org#23272)
  webui: fix Tailwind v4 utility classes missing when built via cmake (ggml-org#23253)
  llama: initialize pre-norm embedding mask flag (ggml-org#23256)
  add myself to conversion (ggml-org#23261)
  ci : added kleidiai-server to server-self-hosted workflow (ggml-org#22435)
  scripts : allow wc2wt with an existing branch (ggml-org#23189)
  sycl: scalar SWAR byte-subtract in Q6_K MMVQ dot product (ggml-org#22156)
  sycl: route small f32 matmuls to oneMKL, bypass oneDNN (ggml-org#22150)
  sycl : fix error when use -mg 1 error (ggml-org#23140)
  update bid to match each layers MTP source (ggml-org#23237)
  cmake : do not check for bin install dir (ggml-org#23234)
  feat: Support d_conv=15 for ssm-conv.cu (ggml-org#23017)
  ...
kgrama pushed a commit to kgrama/llama.cpp that referenced this pull request May 19, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
xxmustafacooTR pushed a commit to xxPlayground/llama-cpp-turboquant that referenced this pull request May 19, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 19, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request May 19, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
fhnmor21 pushed a commit to fhnmor21/llama-cpp-turboquant that referenced this pull request May 19, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
dbrain pushed a commit to dbrain/hbd-llama-cpp-turboquant that referenced this pull request May 21, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
baramofme pushed a commit to baramofme/llama-cpp-turboquant that referenced this pull request May 23, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
srossitto79 pushed a commit to srossitto79/llama.cpp that referenced this pull request May 23, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026
…2435)

* kleidiai: added kleidiai-server to server-self-hosted workflow

 * Added KleidiAI-enabled Arm64 Linux llama-server CI/integration test
   workflow into the server-self-hosted.yml configuration file

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Change-Id: I032e33c525b7e26bc5d53719f638bee610cec1ee

* Added self-hosted executor for KleidiAI server workflow

Signed-off-by: Martin Klacer <martin.klacer@arm.com>

* Update .github/workflows/server-self-hosted.yml

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Signed-off-by: Martin Klacer <martin.klacer@arm.com>
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

devops improvements to build systems and github actions

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants