Add Local Model Support via Ollama Integration #2

crmne · 2025-03-11T08:48:21Z

TL;DR: Add support for running fully local models via Ollama.

Background

While cloud models offer state-of-the-art capabilities, there are compelling cases for running models locally:

Privacy & Compliance: Keep sensitive data entirely on-premise
Cost Control: Eliminate ongoing API costs for high-volume applications
Latency: Remove network overhead for latency-sensitive applications
Offline Operation: Run AI features without internet connectivity

Ollama provides an excellent way to run models like Llama, Mistral, and others locally with a simple API that's compatible with our existing architecture.

Proposed Solution

Add a new provider interface for Ollama that implements our existing abstractions:

# Configuration
RubyLLM.configure do |config|
  config.ollama_host = "http://localhost:11434" # Default
end

# Usage remains identical to cloud models
chat = RubyLLM.chat(model: 'llama2')
chat.ask("What's the capital of France?")

# Or with embeddings
RubyLLM.embed("Ruby is a programmer's best friend", model: 'nomic-embed-text')

Technical Details

For those looking to help implement this, you'll need to:

Create a new provider module in lib/ruby_llm/providers/ollama/
Implement the core provider interface methods:
- complete - For chat functionality
- embed - For embeddings
- api_base - Returns the Ollama API endpoint
- capabilities - Define model capabilities
Handle the payload formatting differences between Ollama and OpenAI/Claude

The PR should include:

Provider implementation
Configuration option for Ollama host
Tests that can be run against a local Ollama instance
Documentation updates

Benefits

Cost efficiency: Eliminate API costs for many use cases
Privacy: Keep sensitive data local
Flexibility: Mix and match local and cloud models in the same codebase
Performance: Reduce latency for response-time sensitive applications

The text was updated successfully, but these errors were encountered:

Mizokuiam · 2025-03-15T06:48:56Z

Hey @crmne, this is a fantastic proposal! Local model support via Ollama would be a huge win for ruby_llm users. The benefits you've outlined – privacy, cost control, latency reduction, and offline capabilities – are all spot-on.

I really like the proposed configuration and usage pattern. Keeping the API consistent with existing cloud models minimizes the learning curve for users. The example snippet is clear and concise:

# Configuration
RubyLLM.configure do |config|
  config.ollama_host = "http://localhost:11434" # Default
end

# Usage remains identical to cloud models
chat = RubyLLM.chat(model: 'llama2', provider: :ollama) # Explicitly set provider
chat.ask("What's the capital of France?")

# Or with embeddings
RubyLLM.embed("Ruby is a programmer's best friend", model: 'nomic-embed-text', provider: :ollama) # Explicitly set provider

One minor suggestion: Adding an explicit provider: :ollama argument to chat and embed calls would provide more clarity and control, especially when using a mixed local/cloud setup. It also future-proofs the codebase for potential naming conflicts if cloud providers ever introduce similarly named models.

Regarding the technical details, your outline is solid. The separation into complete and embed methods makes sense, and handling payload formatting within the provider module is the right approach. Ensuring comprehensive tests against a local Ollama instance is crucial.

One thing to consider during implementation is error handling. Ollama might return different error codes and messages compared to cloud providers. The provider should gracefully handle these differences and translate them into consistent ruby_llm exceptions. We should also consider how to handle cases where the Ollama server is unavailable or returns unexpected responses.

This was referenced Mar 12, 2025

Support OpenAI compatible providers through a base URL #6

Closed

Add Ollama as a supported provider #10

Draft

crmne mentioned this issue Mar 14, 2025

Custom URL support #9

Open

adenta mentioned this issue Mar 15, 2025

How would I connect to a local Ollama Server? #24

Closed

crmne marked this as a duplicate of #24 Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Local Model Support via Ollama Integration #2

Add Local Model Support via Ollama Integration #2

crmne commented Mar 11, 2025

Mizokuiam commented Mar 15, 2025

Add Local Model Support via Ollama Integration #2

Add Local Model Support via Ollama Integration #2

Comments

crmne commented Mar 11, 2025

Background

Proposed Solution

Technical Details

Benefits

Mizokuiam commented Mar 15, 2025