github
GitHub APIKeeps: repo, release, stars delta
- 2026-05-30ggerganov/llama.cpp b9437: b9437
<details open> Support `-fa auto` in llama-bench (#23714) * Support `-fa auto` in llama-bench Make the default value of `-ngl` -1, similar to other tools. Update README with latest usage and examples * Address review comments </details> **macOS/iOS:** - [macOS Apple Silico
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9436: b9436
<details open> opencl: support bf16 by converting to f16 (#23839) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9436/llama-b9436-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABL
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9434: b9434
<details open> TP: fix granularity for Qwen 3.5/3.6 + 3 GPUs (#23843) * TP: fix granularity for Qwen 3.5/3.6 + 3 GPUs * fix afmoe TP </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9434/llama-b9434-bin-macos-
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9433: b9433
<details open> metal : restore im2col implementation for large kernels (#23901) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9433/llama-b9433-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI en
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9432: b9432
<details open> test: (test-llama-archs) log the config name first (#23885) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9432/llama-b9432-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9431: b9431
<details open> ci : update ios-xcode release job to macos-26 (#23906) * ci : disable libcommon build from xcframework * ocd : fix name * ci : ios-xcode change to macos-26 * cont : pin xcode * cont : pin xcode to minor version </details> **macOS/iOS:** - [macOS Apple Silic
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9430: b9430
<details open> ggml : add some lsx support (#23798) * loongarch : optimize LSX fp16 load/store with native intrinsics Use __lsx_vfcvtl_s_h and __lsx_vfcvt_h_s instead of scalar loops in __lsx_f16x4_load and __lsx_f16x4_store. * loongarch : add LSX implementation for q8_0 dot
github:ggerganov/llama.cpp - 2026-05-30ggerganov/llama.cpp b9428: b9428
<details open> ci : fix s390x release job (#23898) * ci : fix s390x release job * ci : multi-thread build for `ios-xcode` * ocd : names </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9428/llama-b9428-bin-ma
github:ggerganov/llama.cpp