feat: adding cache to LLMEndpoint #3555

jacopo-chevallard · 2025-01-27T10:33:04Z

This allows us to avoid repeating expensive operations, such as reloading the tokenizers, at each call

Closes ENT-394

… each call

linear · 2025-01-27T10:34:31Z

🤖 I have created a release *beep* *boop* --- ## [0.0.30](core-0.0.29...core-0.0.30) (2025-01-27) ### Features * adding cache to LLMEndpoint ([#3555](#3555)) ([6072907](6072907)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

feat: adding cache to LLMEndpoint to avoid reloading the tokenizer at…

9cf1df9

… each call

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Jan 27, 2025

jacopo-chevallard requested a review from StanGirard January 27, 2025 10:35

StanGirard requested a review from AmineDiro January 27, 2025 10:37

AmineDiro approved these changes Jan 27, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 27, 2025

AmineDiro merged commit 6072907 into main Jan 27, 2025
7 checks passed

AmineDiro deleted the feat/cache_LLMEndpoint branch January 27, 2025 10:47

StanGirard mentioned this pull request Jan 27, 2025

chore(main): release core 0.0.30 #3556

Merged

Provide feedback