kv-cache : fix seq_rm with seq_id == -1 #15226

ggerganov · 2025-08-11T06:46:58Z

When seq_id == -1 we remove the tokens from all KV cache buffers.

ggml-ci

kv-cache : fix seq_rm with seq_id == -1

162c2e6

ggml-ci

cont : iterate over streams

b2b0335

ggml-ci

ggerganov merged commit 228f724 into master Aug 11, 2025
53 of 55 checks passed

ggerganov deleted the gg/kv-cache-fix-seq-rm branch August 11, 2025 10:58

compilade mentioned this pull request Aug 12, 2025

fix: llama_memory_seq_rm(mem, -1, ...) #15200

Closed

Provide feedback