llama : fix kv shift bug #3835

ggerganov · 2023-10-28T15:33:09Z

resolve #3825

The KV shift was applied only if there was at least one token "leaving" the cache.
Also, this change accumulates deltas from sequential shifts as proposed in #3825.

ggml-ci

* ggml-org/llama.cpp#3835

ggml-ci

* ggml-org/llama.cpp#3835

llama : fix kv shift bug

fb64583

ggml-ci

ggerganov mentioned this pull request Oct 28, 2023

llama_kv_cache_seq_shift delta does not appear to be calculated properly #3825

Closed

ggerganov merged commit 71a09da into master Oct 29, 2023

ggerganov deleted the fix-kv-shift branch October 29, 2023 16:32

Nexesenex pushed a commit to Nexesenex/croco.cpp that referenced this pull request Oct 30, 2023

llama : fix kv shift bug (ggml-org#3835)

522a1d3

ggml-ci

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 17, 2023

Fix kv shift bug

2a78d85

* ggml-org/llama.cpp#3835

olexiyb pushed a commit to Sanctum-AI/llama.cpp that referenced this pull request Nov 23, 2023

llama : fix kv shift bug (ggml-org#3835)

dae17a4

ggml-ci

brittlewis12 added a commit to brittlewis12/llmfarm_core.swift that referenced this pull request Nov 30, 2023

Fix kv shift bug

d49648f

* ggml-org/llama.cpp#3835

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : fix kv shift bug #3835

llama : fix kv shift bug #3835

ggerganov commented Oct 28, 2023

llama : fix kv shift bug #3835

llama : fix kv shift bug #3835

Conversation

ggerganov commented Oct 28, 2023