github
GitHub APIKeeps: repo, release, stars delta
- 2026-05-07ggerganov/llama.cpp b9066: b9066
<details open> CUDA: batch out_prod inner loop with cublasSgemmStridedBatched (#22651) * CUDA: batch out_prod inner loop with cublasSgemmStridedBatched * CUDA: batch out_prod inner loop with cublasSgemmStridedBatched * CUDA: add cublasSgemmStridedBatched mapping for HIP and M
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9064: b9064
<details open> llama : fix device state save/load (#22805) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9064/llama-b9064-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](https://gith
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9063: b9063
<details open> opencl: add opfilter regex for debugging (#22782) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9063/llama-b9063-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](https:
github:ggerganov/llama.cpp ## Table of Contents - [Dialect & Frontend](#dialect--frontend) - [Backend & Compiler](#backend--compiler) - [AMD/HIP Backend](#amdhip-backend) - [NVIDIA Backend](#nvidia-backend) - [Gluon & Layout Improvements](#gluon--layout-improvements) - [Kernels & Benchmarks](#kernels
github:openai/triton## Table of Contents - [Dialect & Frontend](#dialect--frontend) - [Backend & Compiler](#backend--compiler) - [AMD/HIP Backend](#amdhip-backend) - [NVIDIA Backend](#nvidia-backend) - [Gluon & Layout Improvements](#gluon--layout-improvements) - [Kernels & Benchmarks](#kernels
github:triton-lang/triton- 2026-05-07ggerganov/llama.cpp b9062: b9062
<details open> common/chat : preserve media markers for typed-content templates (#22634) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9062/llama-b9062-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, K
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9061: b9061
<details open> tests: add long-sequence cases and fix inputs for gated_delta_net (#22794) * tests : add long-seq + tail cases for gated_delta_net * tests : realistic input ranges for gated_delta_net </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9060: b9060
<details open> sycl: add FILL, CUMSUM, DIAG, SOLVE_TRI, SSM_SCAN, GATED_DELTA_NET (#22149) * sycl: add FILL, CUMSUM, DIAG, SOLVE_TRI, SSM_SCAN, GATED_DELTA_NET Signed-off-by: Chun Tao <chun.tao@intel.com> * Fix abort during test-backend-ops Signed-off-by: Todd Malsbary <todd
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9058: b9058
<details open> llama : remove unnecessary seq_id check during state restore (#22797) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9058/llama-b9058-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, Kleid
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9057: b9057
<details open> ggml-cpu: Optimized risc-v cpu q1_0 dot </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9057/llama-b9057-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (arm64, KleidiAI enabled)](https://github.c
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9056: b9056
<details open> mtmd: fix whisper audio tail truncation by exposing padded buffer to FFT (#22770) </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9056/llama-b9056-bin-macos-arm64.tar.gz) - [macOS Apple Silicon (
github:ggerganov/llama.cpp - 2026-05-07ggerganov/llama.cpp b9050: b9050
<details open> llama : add missing call to ggml_backend_load_all() (#22752) Signed-off-by: Adrien Gallouët <angt@huggingface.co> </details> **macOS/iOS:** - [macOS Apple Silicon (arm64)](https://github.com/ggml-org/llama.cpp/releases/download/b9050/llama-b9050-bin-macos-arm64
github:ggerganov/llama.cpp - NVDAgithub:NVIDIA/Megatron-LM