-
Notifications
You must be signed in to change notification settings - Fork 11.5k
server : re-enable completion and embedded at the same time, fixes #3815 #3876
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
server : re-enable completion and embedded at the same time, fixes #3815 #3876
Conversation
I tested this branch with
which returns
and
which returns an embedding vector
LGTM |
I apologize for the inconvenience. I am deploying a server with the following parameters: -cb -v --embedding -np 3 -c 8192 --host "0.0.0.0" -ngl 64. When I perform multiple embedding requests, a segmentation fault occurs. I noticed that if there are 2 slots performing the embedding task simultaneously, it causes an error. I hope to receive a solution soon. Thank you very much. |
Fixes #3815