You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been trying to add the ability for oobabooga webui to choose the number of Mixtral experts when using llama.cpp loader, just like the ExLlamav2 loader can. I asked on the llama.cpp GitHub, and I was told about a way to do this from the command line when using llama.cpp directly:
But, oobabooga webui is written in Python, and is using llama-cpp-python. It calls it like this:
I can't see how to pass that --override-kv command line parameter into this Python interface. Putting it in the list of params there doesn't seem to work. Can anyone tell me how to do this?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hello!
I've been trying to add the ability for oobabooga webui to choose the number of Mixtral experts when using llama.cpp loader, just like the ExLlamav2 loader can. I asked on the llama.cpp GitHub, and I was told about a way to do this from the command line when using llama.cpp directly:
But, oobabooga webui is written in Python, and is using llama-cpp-python. It calls it like this:
I can't see how to pass that --override-kv command line parameter into this Python interface. Putting it in the list of params there doesn't seem to work. Can anyone tell me how to do this?
Beta Was this translation helpful? Give feedback.
All reactions