Add Ollama as a supported provider #10

ldmosquera · 2025-03-12T22:21:09Z

As mentioned in issue #2, Ollama support is valuable for users interested in self-hosted/offline inference.

This PR adds initial support for Ollama including chat completions, streaming and embeddings. No tool support yet, needs further investigation.

PR is a rough draft and will likely take some back and forth to get it merge-ready; more comments to follow.

Closes #2

This is so that doing `models.refresh!` won't attempt to use every provider API, resulting in errors unless a valid key is given for every one. If this is too disruptive to be default behavior, then this could be a "default to offline" mode that needs to be explicitly turned on perhaps with an environment variable; in its absence, behavior would be the same as before. This is all so that `models.refresh!` can be called freely to populate Ollama models at runtime.

ldmosquera · 2025-03-12T22:30:03Z

This was done copying and adapting from the existing Gemini provider. Tool usage and media was excised for now; I still need to research what Ollama offers and what overlaps within this project. Also, capabilities.rb is full of placeholder stuff until we figure out sane defaults since Ollama allows running arbitrary models instead of a well known list.

Commit 9b387c1 disables all providers by default in models.refresh! unless they are explicitly configured.

The intention is to be able to work on Ollama (or any other specific single provider) without being forced to provide valid keys for EVERY provider, also incurring live calls. In particular, because Ollama doesn't come with any default models and so models.refresh! is mandatory before any usage to populate available models on the server.

As mentioned in the commit message, this might be too intrusive for the intended usage of this project, so an alternative is to only do this when given a specific env var that puts ruby-llm into such a "default offline" mode.

This might interfere with tests and/or the models update rake task, none of which I touched yet since they also require valid keys for all providers.

As for tests, I'd like some guidance into how to implement unit testing and eventually integration testing specifically for Ollama, in a way that does not require configuring and using all APIs (and attending cost).

Since Ollama does not come with default models, I suggest a separate test suite that first ensures that a tiny model is downloaded into the Ollama server via its API.

ldmosquera · 2025-03-12T22:39:54Z

lib/ruby_llm/provider.rb

-              unless data == '[DONE]'
-                parsed_data = JSON.parse(data)
-                block.call(parsed_data)
+            content_type = env.response_headers['content-type']


I'm not super sure about this; as far as I could see, Ollama uses newline delimited JSON lines rather than standard event streams.

Of the API providers, I only have a Gemini key and it does work after this commit (both streaming and synch) so this might be correct.

jm3 · 2025-03-13T03:11:16Z

was just about to file an issue for this — awesome!

ldmosquera added 7 commits March 12, 2025 18:30

WIP: first bits of Ollama provider, adapted from Gemini

a030bcc

Basic chat functionality

0ab3337

Add support for NDJSON streaming

52306ba

Basic streaming functionality

e7ae937

Basic embedding functionality

a475ead

Mention Ollama in getting-started.md

fe39682

ldmosquera commented Mar 12, 2025

View reviewed changes

ldmosquera added 2 commits March 12, 2025 20:15

rubocop autocorrects

1801adc

More rubocop appeasing

6bcdfa2

hemanth mentioned this pull request Mar 15, 2025

feat: support for Ollama #18

Closed

khasinski mentioned this pull request Mar 16, 2025

Add OpenRouter as a supported provider #29

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Ollama as a supported provider #10

Add Ollama as a supported provider #10

ldmosquera commented Mar 12, 2025 •

edited

Loading

ldmosquera commented Mar 12, 2025 •

edited

Loading

ldmosquera Mar 12, 2025

jm3 commented Mar 13, 2025

Add Ollama as a supported provider #10

Are you sure you want to change the base?

Add Ollama as a supported provider #10

Conversation

ldmosquera commented Mar 12, 2025 • edited Loading

ldmosquera commented Mar 12, 2025 • edited Loading

ldmosquera Mar 12, 2025

Choose a reason for hiding this comment

jm3 commented Mar 13, 2025

ldmosquera commented Mar 12, 2025 •

edited

Loading

ldmosquera commented Mar 12, 2025 •

edited

Loading