github
GitHub APIKeeps: repo, release, stars delta
- 2026-05-01ggerganov/llama.cpp b8999: b8999
<details open> llama-quant : fix `--tensor-type` when default `qtype` is overriden (#22572) fix #22544 (my fault!) Credit to @Anai-Guo, ref #22559 - since that one was closed due to the new contributor policy I am taking the liberty of re-submitting that PR here. </details>
github:ggerganov/llama.cpp - 2026-05-01ggerganov/llama.cpp b8998: b8998
<details open> hexagon: enable non-contiguous row tensor support for unary ops (#22574) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b8998/llama-b8998-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, Kl
github:ggerganov/llama.cpp - 2026-05-01ggerganov/llama.cpp b8996: b8996
<details open> ggml-webgpu: Fix vectorized handling in mul-mat and mul-mat-id (#22578) * Fix vectorized condition of mul-mat-fast pipeline and add vectorized variant to mul-mat-id * Apply suggestion from @CISC Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> --
github:ggerganov/llama.cpp - 2026-05-01ggerganov/llama.cpp b8994: b8994
<details open> ggml-webgpu: add the upscale shader (#22419) * shader(upscale): add the upscale shader with nearest, bilinear and bicubic implementations * shader(upscale): use macro </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.c
github:ggerganov/llama.cpp