github
GitHub APIKeeps: repo, release, stars delta
- 2026-06-27ggerganov/llama.cpp b9828: b9828
<details open> opencl: flash attention improvement (#25069) * opencl: rework FA kernel for f16 and f32 * opencl: flash-attention prefill prepass kernels - flash_attn_kv_pad_f16 pads the tail KV tile to a BLOCK_N multiple - flash_attn_mask_pad_f16 pads the matching mask ti
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9827: b9827
<details open> [CUDA] Added a cudaMemcpy2DAsync fast path to ggml_cuda_cpy (#25057) * [CUDA] Added a cudaMemcpy2DAsync fast path to ggml_cuda_cpy Add a CUDA ggml_cpy fast path for same-type, same-shape strided copies that are just 2D pitched block copies. When tensors are not
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9826: b9826
<details open> sycl : fix failed ut cases of norm (#25044) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9826/llama-b9826-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABLED](htt
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9825: b9825
<details open> vulkan: fix step operator for 0 input (#25036) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9825/llama-b9825-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISABLED](
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9824: b9824
<details open> binaries : Improve rpc-server and export-graph-ops names. (#25045) Tests are generally prefixed with -test, so rename export-graph-ops accordingly. rpc-server is probably too generic a name for /usr/bin. Because it should work with any ggml application, it is re
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9823: b9823
<details open> ci : add windows-openvino to check-release (#25022) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9823/llama-b9823-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled) [DISAB
github:ggerganov/llama.cpp - 2026-06-27ggerganov/llama.cpp b9822: b9822
<details open> tests : fix test-chat-template --no-common option (#25075) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9822/llama-b9822-bin-macos-arm64.tar.gz) - macOS Apple Silicon (arm64, KleidiAI enabled)
github:ggerganov/llama.cpp