github
GitHub APIKeeps: repo, release, stars delta
- 2026-06-03ggerganov/llama.cpp b9496: b9496
<details open> mtmd: fix Gemma 4 unified FPE (#24088) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9496/llama-b9496-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABLED](https://
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9495: b9495
<details open> qwen35: use post-norm hidden state for MTP (#24025) * qwen35: use post-norm hidden state for MTP * rename pre_norm to nextn * fix step35 </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9495/ll
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9494: b9494
<details open> mtmd: enable non-causal vision for gemma 4 unified (#24082) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9494/llama-b9494-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9493: b9493
<details open> mtmd, model: allow skip build_vit() (#24077) * add model * nits </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9493/llama-b9493-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI e
github:ggerganov/llama.cpp # Release v5.10.1 v5.10.0 was yanked as we publish on a corrupted branch. Sorry everyone, this happens when we rush a release!!! ## New Model additions ### Gemma4 unified+ Gemma4 MTP <img width="2000" height="400" alt="image" src="https://github.com/user-attachments/asse
github:huggingface/transformers- 2026-06-03ggerganov/llama.cpp b9491: b9491
<details open> Avoid PDL race conditions by disabling __restrict__ when PDL is used (#24030) * Removes __restrict__ from PDL kernel headers due to incompatibility with PDL. Adds preprocessor directives based on arch in kernel body to add __restrict__ to retain performance on ol
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9490: b9490
<details open> ggml-cpu: use runtime SVE width in FWHT (#24059) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9490/llama-b9490-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABLED
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9489: b9489
<details open> cuda: reserve space for quantize kv-cache at startup (#23907) * cuda: reserve space for quantize kv-cache at startup * address review comments * remove forward decl Co-authored-by: Johannes Gäßler <johannesg@5d6.de> * remove assert in ggml-cuda.cu Co-authore
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9488: b9488
<details open> tests : add support for qwen3 SSM archs (#24031) * tests : add support for qwen3 SSM archs * arch : add LLM_KV_ATTENTION_RECURRENT_LAYERS * cont : naming + TODOs </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/r
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9487: b9487
<details open> update BoringSSL to 0.20260526.0 (#23794) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9487/llama-b9487-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABLED](https
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9486: b9486
<details open> ci : disable ccache for msvc windows release jobs (#23911) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9486/llama-b9486-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled)
github:ggerganov/llama.cpp - 2026-06-03ggerganov/llama.cpp b9485: b9485
<details open> arg : removed unecesary mmproj download when users pass --no-mmproj (#23425) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9485/llama-b9485-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64,
github:ggerganov/llama.cpp