-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replace litellm
with native API implementations.
#1252
Conversation
Omaigá |
This pull request is humongous, sorry about that, but it turns out that almost every piece of code with the exception of the database layer was relying on To make it easier to review, you can think of this refactoring as three big chunks of work
Feel free to pull me in a call if necessary, but my last suggestion would be to just try it out locally and see if it works. |
6c342b7
to
b67b829
Compare
Hey @lukehinds , thank you so much for spending time testing and reviewing this! 🙏 I would be inclined to merge this, although we have some integration tests failures due to an undesired (or unexpected) compatibility of muxing with Ollama-specific APIs. TL;DR is that, by design, muxing was expecting to talk to OpenAI endpoints, but My proposal is to
Please folks, chime in if you don't agree. Meanwhile, I'll start making the described changes. |
25a35ad
to
4aab11c
Compare
Refactors client architecture to use native implementations instead of `litellm` dependency. Adds support for OpenAPI, Ollama, OpenRouter, and fixes multiple issues with Anthropic and Copilot providers. Improves message handling and streaming responses. Commit message brought you by Anthropic Claude 3.7. Co-Authored-By: Jakub Hrozek <[email protected]>
This was missed. Signed-off-by: Juan Antonio Osorio <[email protected]>
This change aims to make it simpler to track down in which step of the pipeline a particulare exception occurred.
4aab11c
to
7212f18
Compare
bf8d9e8
to
1452eff
Compare
Refactors client architecture to use native implementations instead of
litellm
dependency. Adds support for OpenAPI, Ollama, OpenRouter, and fixes multiple issues with Anthropic and Copilot providers. Improves message handling and streaming responses.Commit message brought you by Anthropic Claude 3.7.