is it possible to use llama,cpp with other neural networks? #158

dbpaul · 2023-03-15T09:31:55Z

I have no clue about this, but I saw that chatglm-6b was published, which should run on CPU with 16GB ram, albeit very slow.
https://huggingface.co/THUDM/chatglm-6b/tree/main

Would it be possible to substitute the llama model?

v3ss0n · 2023-03-15T19:24:20Z

if you are going to write the code in c base on ggml , yes.
Also please move to
https://github.com/ggerganov/llama.cpp/discussions

Ayushk4 · 2023-03-25T17:16:59Z

So far 10 different models are supported across 5 different architectures (including OpenAssistant and Open-Chat-Kit models) are supported by nolanoorg/cformers.

You can now interface with the models with just 3 lines of code from python.

from interface import AutoInference as AI
ai = AI('OpenAssistant/oasst-sft-1-pythia-12b')
x = ai.generate("<|prompter|>What's the Earth total population<|endoftext|><|assistant|>", num_tokens_to_generate=100); print(x['token_str'])

Generation speed is same as this repo (75 ms/token for 12B model on Macbook Pro)

gjmulder added enhancement New feature or request model Model specific labels Mar 15, 2023

ggerganov closed this as completed Jul 28, 2023

Deadsg pushed a commit to Deadsg/llama.cpp that referenced this issue Dec 19, 2023

Add in-memory longest prefix cache. Closes ggml-org#158

0e94a70

Bearsaerker mentioned this issue Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is it possible to use llama,cpp with other neural networks? #158

is it possible to use llama,cpp with other neural networks? #158

dbpaul commented Mar 15, 2023

v3ss0n commented Mar 15, 2023 •

edited

Loading

Ayushk4 commented Mar 25, 2023 •

edited

Loading

is it possible to use llama,cpp with other neural networks? #158

is it possible to use llama,cpp with other neural networks? #158

Comments

dbpaul commented Mar 15, 2023

v3ss0n commented Mar 15, 2023 • edited Loading

Ayushk4 commented Mar 25, 2023 • edited Loading

v3ss0n commented Mar 15, 2023 •

edited

Loading

Ayushk4 commented Mar 25, 2023 •

edited

Loading