Skip to content

Actions: ggml-org/llama.cpp

Actions

Pull Request Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
17,750 workflow runs
17,750 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add EXAONE MoE implementations
Pull Request Labeler #22981: Pull request #18543 synchronize by nuxlear
Queued
HIP: add fattn-mma-f16 for RDNA4
Pull Request Labeler #22980: Pull request #18481 synchronize by zhang-hui-yulo
17s
ggml-cuda: extend concat support for more types
Pull Request Labeler #22977: Pull request #18690 opened by Lourdle
1m 0s
vulkan: fix push constant size for quantize_q8_1
Pull Request Labeler #22974: Pull request #18687 synchronize by jeffbolznv
3m 12s
Add EXAONE MoE implementations
Pull Request Labeler #22972: Pull request #18543 synchronize by nuxlear
22s
full modern bert support
Pull Request Labeler #22971: Pull request #18330 synchronize by ryan-mangeno
10s
ggml webgpu: initial flashattention implementation
Pull Request Labeler #22968: Pull request #18610 synchronize by reeselevine
1h 28m 57s
implement adaptive-p sampler
Pull Request Labeler #22967: Pull request #17927 synchronize by ddh0
1h 0m 25s
model: try to improve Qwen3 Next
Pull Request Labeler #22966: Pull request #18683 opened by ngxson
1h 1m 55s
opencl: add fill op
Pull Request Labeler #22965: Pull request #18682 opened by shaofeiqi
44m 32s
Autoparser - complete refactoring of parser architecture
Pull Request Labeler #22964: Pull request #18675 synchronize by pwilkin
19m 16s
implement adaptive-p sampler
Pull Request Labeler #22963: Pull request #17927 synchronize by ddh0
15m 40s
Autoparser - complete refactoring of parser architecture
Pull Request Labeler #22958: Pull request #18675 synchronize by pwilkin
12m 27s
Add EXAONE MoE implementations
Pull Request Labeler #22957: Pull request #18543 synchronize by nuxlear
27m 37s