Replies: 2 comments 3 replies
-
Run "main --help" to see parameter usage explained. |
Beta Was this translation helpful? Give feedback.
1 reply
-
You can find the documentation for the main binary here: https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Maybe I'm dumb and missed it, but there seems to be effectively zero documentation on using the windows binaries. I was able to piece together enough to get it running but there's still some things I don't get. For example, it would have been nice to have had a descriptions of which binary to use and what each version means.
And I still haven't found if there's a list of parameters are for launching using main.exe. I found another post where someone used the following:
main.exe -i --interactive-first -r "### Human:" --temp 0 -c 2048 -n -1 --ignore-eos --repeat_penalty 1.2 --instruct -m whateverModelFileName
Which works but I want to be able to use my GPU, yet it says
offloading 0 layers to GPU
,total VRAM used: 0 MB
and I can't find anything talking about how to change that.Beta Was this translation helpful? Give feedback.
All reactions