Source https://github.com/ggerganov/llama.cpp/pull/4309
Source ggml-org/llama.cpp#4309