community[patch]: llama cpp stream generation and abort by invoke method #4942

wizardAEI · 2024-04-01T03:28:03Z

Hi ! Recently, I have been using LangChain to develop my own application.

When using Llama CPP, I noticed that the streaming generation solution mentioned in the documentation cannot be paused.

However, after consulting the official documentation of node-llama-cpp, I found that the streaming and pausing functions can also be achieved using invoke : https://withcatai.github.io/node-llama-cpp/guide/chat-session#response-streaming.

It only requires a slight modification of the onToken method to resolve this issue. Therefore, I made some changes to the LLM CPP part.

and there is test results:

vercel · 2024-04-01T03:28:07Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchainjs-api-refs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Apr 3, 2024 5:13am
langchainjs-docs	✅ Ready (Inspect)	Visit Preview		Apr 3, 2024 5:13am

jacoblee93 · 2024-04-03T04:50:25Z

Thank you! Nice catch

wangdejiang and others added 3 commits March 19, 2024 13:11

faat: Add ERNIE-Speed-8K, ERNIE-Speed-128K from Baidu Wenxin

12e306b

Merge branch 'langchain-ai:main' into main

dec18be

feat: llama cpp stream generation and abort by invoke method

fd8b283

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. auto:improvement Medium size change to existing code to handle new use-cases labels Apr 1, 2024

vercel bot deployed to Preview – langchainjs-api-refs April 1, 2024 03:34 View deployment

vercel bot deployed to Preview – langchainjs-docs April 1, 2024 03:34 View deployment

Merge branch 'main' into main

af35e93

vercel bot deployed to Preview – langchainjs-api-refs April 2, 2024 02:46 View deployment

vercel bot deployed to Preview – langchainjs-docs April 2, 2024 02:46 View deployment

jacoblee93 changed the title ~~llama cpp stream generation and abort by invoke method~~ community[patch]: llama cpp stream generation and abort by invoke method Apr 3, 2024

Format

8029902

vercel bot deployed to Preview – langchainjs-docs April 3, 2024 05:12 View deployment

vercel bot deployed to Preview – langchainjs-api-refs April 3, 2024 05:13 View deployment

jacoblee93 merged commit 560a1a8 into langchain-ai:main Apr 3, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

community[patch]: llama cpp stream generation and abort by invoke method #4942

community[patch]: llama cpp stream generation and abort by invoke method #4942

wizardAEI commented Apr 1, 2024

vercel bot commented Apr 1, 2024 •

edited

Loading

jacoblee93 commented Apr 3, 2024

community[patch]: llama cpp stream generation and abort by invoke method #4942

community[patch]: llama cpp stream generation and abort by invoke method #4942

Conversation

wizardAEI commented Apr 1, 2024

vercel bot commented Apr 1, 2024 • edited Loading

jacoblee93 commented Apr 3, 2024

vercel bot commented Apr 1, 2024 •

edited

Loading