Skip to content

Commit 6391817

Browse files
authored
llama : document logits_all deprecation (#4418)
llama_context_params.logits_all is a parameter for controlling llama_eval. This documents that logits_all should not be used with llama_decode and llama_batch.
1 parent d9d4cfe commit 6391817

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

llama.h

+1-1
Original file line numberDiff line numberDiff line change
@@ -216,7 +216,7 @@ extern "C" {
216216

217217
// Keep the booleans together to avoid misalignment during copy-by-value.
218218
bool mul_mat_q; // if true, use experimental mul_mat_q kernels (DEPRECATED - always true)
219-
bool logits_all; // the llama_eval() call computes all logits, not just the last one
219+
bool logits_all; // the llama_eval() call computes all logits, not just the last one (DEPRECATED - set llama_batch.logits instead)
220220
bool embedding; // embedding mode only
221221
bool offload_kqv; // whether to offload the KQV ops (including the KV cache) to GPU
222222
};

0 commit comments

Comments
 (0)