We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent ed9a54e commit b472f3fCopy full SHA for b472f3f
README.md
@@ -11,6 +11,7 @@ Inference of [LLaMA](https://arxiv.org/abs/2302.13971) model in pure C/C++
11
12
**Hot topics:**
13
14
+- Simple web chat example: https://github.com/ggerganov/llama.cpp/pull/1998
15
- k-quants now support super-block size of 64: https://github.com/ggerganov/llama.cpp/pull/2001
16
- New roadmap: https://github.com/users/ggerganov/projects/7
17
- Azure CI brainstorming: https://github.com/ggerganov/llama.cpp/discussions/1985
0 commit comments