github
GitHub APIKeeps: repo, release, stars delta
- 2026-05-15ggerganov/llama.cpp b9173: b9173
<details open> ci : fix release symlinks (#23119) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9173/llama-b9173-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](https://github.com/gg
AAPLgithub:ggerganov/llama.cpp - 2026-05-15ggerganov/llama.cpp b9172: b9172
<details open> webui: Use lowercase hash for HF checksum check (#23107) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9172/llama-b9172-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)]
AAPLgithub:ggerganov/llama.cpp - 2026-05-15ggerganov/llama.cpp b9169: b9169
<details open> mtmd: add chunks and fix preproc for qwen3a (#23073) * mtmd: add chunks and fix preproc for qwen3a * add attn_mask * limit mtmd_chunk size (avoid blow up memory) * correct audio tokens * re-order the set_input case * remove attn_mask </details> **macOS/iOS
AAPLgithub:ggerganov/llama.cpp - 2026-05-15ggerganov/llama.cpp b9165: b9165
<details open> ci : fix transform of top . entry in release archive (#23080) * fix transform of top . entry in release archive * simplify </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9165/llama-b9165-bin-m
github:ggerganov/llama.cpp - 2026-05-15ggerganov/llama.cpp b9163: b9163
<details open> reasoning-budget: clone should do a deep-copy (#23095) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9163/llama-b9163-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](h
github:ggerganov/llama.cpp - 2026-05-15ggerganov/llama.cpp b9161: b9161
<details open> Support for Codex CLI by skipping unsupported Responses tools (#23041) * Support for Codex CLI by skipping unsupported Responses tools * Warn on skipped Responses tools and preserve gpt-oss apply_patch rejection * Revert gpt-oss apply_patch special handling </
github:ggerganov/llama.cpp - 2026-05-15vllm-project/vllm v0.21.0: v0.21.0
## Highlights This release features 367 commits from 202 contributors (49 new)! * **Transformers v4 deprecated**: This release formally deprecates `transformers` v4 support (#40389). Users should migrate to `transformers` v5. * **C++20 build requirement**: vLLM now require
github:vllm-project/vllm - 2026-05-15ggerganov/llama.cpp b9159: b9159
<details open> ggml-hexagon: cpy: add contiguous fast-path in reshape copy (#23076) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9159/llama-b9159-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, Kleidi
github:ggerganov/llama.cpp