github
GitHub APIKeeps: repo, release, stars delta
- 2026-07-01ggerganov/llama.cpp b9859: b9859
<details open> opencl: allow loading precompiled binary kernels from library (#23042) * opencl: allow loading binary kernel * opencl: add libdl.h * ggml-backend-dl is in ggml, which depends backend libs, thus ggml-opencl cannot depend on ggml-backend-dl * add libdl.h to bre
github:ggerganov/llama.cpp - 2026-07-01ggerganov/llama.cpp b9858: b9858
<details open> common : use hf primary split as model path (#25194) Fixes #25181 </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9858/llama-b9858-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI
github:ggerganov/llama.cpp - 2026-07-01ggerganov/llama.cpp b9857: b9857
<details open> hexagon: flash attention rework (optimizations, accuracy improvements, etc) (#25085) * hex-mm: fold mm quant tasks into the main matmul threads * hex-mm: minor formatting fixes * hex-mm: cleanup is_quant checks in dma dispatch * hex-mm: fix dst-spad alignment
github:ggerganov/llama.cpp - 2026-07-01ggerganov/llama.cpp b9856: b9856
<details open> CUDA: consistent use of __restrict__ + PDL for FA (#25185) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9856/llama-b9856-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled)
github:ggerganov/llama.cpp - 2026-07-01ggerganov/llama.cpp b9855: b9855
<details open> ggml-cpu: add AVX2 optimization for nvfp4 dot product and use UE4M3 LUT (#23961) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9855/llama-b9855-bin-macos-arm64.tar.gz) - macOS Apple Silicon (ar
github:ggerganov/llama.cpp - 2026-07-01ggerganov/llama.cpp b9852: b9852
<details open> opencl: initial q1_0 support (#25160) * opencl: general q1_0 support * opencl: add Adreno GEMM/GEMV for q1_0 </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9852/llama-b9852-bin-macos-arm64.tar
github:ggerganov/llama.cpp