github
GitHub APIKeeps: repo, release, stars delta
- 2026-05-18ggerganov/llama.cpp b9221: b9221
<details open> ggml-hexagon: add PAD op HVX kernel (#23078) * ggml-hexagon: add PAD op HVX kernel Implements GGML_OP_PAD on the Hexagon HTP backend using HVX vectorized kernels. Supports zero-padding and circular padding across all 4 tensor dimensions. * hex-ggml: remove dupl
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9219: b9219
<details open> common : remove hf cache migration (#23266) Signed-off-by: Adrien Gallouët <angt@huggingface.co> </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9219/llama-b9219-bin-macos-arm64.tar.gz) - [macOS
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9216: b9216
<details open> ui: Refactor models store, MCP service, and gate logs behind VITE_DEBUG (#23236) * refactor: Scope console logs to `DEV` + `VITE_DEBUG` env vars * refactor: skip MCP proxy probe when no server requires it * refactor: suppress expected disconnect errors during M
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9213: b9213
<details open> llama: initialize pre-norm embedding mask flag (#23256) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9213/llama-b9213-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9209: b9209
<details open> sycl: scalar SWAR byte-subtract in Q6_K MMVQ dot product (#22156) Signed-off-by: Chun Tao <chun.tao@intel.com> Co-authored-by: Chun Tao <chun.tao@intel.com> </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9208: b9208
<details open> sycl: route small f32 matmuls to oneMKL, bypass oneDNN (#22150) Signed-off-by: Chun Tao <chun.tao@intel.com> Co-authored-by: Chun Tao <chun.tao@intel.com> </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/d
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9204: b9204
<details open> feat: Support d_conv=15 for ssm-conv.cu (#23017) Branch: ModalityConditionalAdapters AI-usage: none Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download
AAPLgithub:ggerganov/llama.cpp - 2026-05-18ggerganov/llama.cpp b9203: b9203
<details open> cmake : fix LLAMA_BUILD_UI logic (#23190) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9203/llama-b9203-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](https://github
AAPLgithub:ggerganov/llama.cpp