Native ollama provider #272

bryanmcguire · 2025-07-08T09:40:03Z

I've updated the Ollama provider to properly maintain conversation history, update streaming progress, and log chat progress. This should look more like the expected behavior.

…e when feeding tool results into the llm.

bryanmcguire · 2025-07-09T02:33:42Z

I'm adding in visual model support right now.

bryanmcguire · 2025-07-09T03:42:11Z

vision model support is in. I also added optional support to include an API key. While Ollama doesn't support api keys out the box, it's common for people (particularly Open WebUI users) to use nginx as a reverse proxy to expose Ollama on the internet using tools like cloudflare tunnel.

bryanmcguire · 2025-07-09T07:24:25Z

I think I've addressed the issues about conversation history, streaming progress indicators, and handing multi-modal models. I've tested the vision functionality with llama4:latest and qwen2.5vl:72b-q8_0.

Again, I'm very sorry for wasting your time with the earlier version that was missing so much functionality that you've already built into the parent model. I hope these changes are of a much better quality, and if there is more work that needs to be done, please tell me what I need to fix.

Again, thank you so much for considering these changes.

evalstate · 2025-07-09T22:41:13Z

This is looking very cool indeed, thank you :)

I've pushed an update to the smoke tests to include ollama.llama3.2:latest e2e test suite (not had a chance to do much diagnosis). The tool calling fails with this:

╭─────────────────────────────────────────────────── (weatherforecast) [USER] ─╮
│                                                                              │
│  what is the weather in london                                               │
│                                                                              │
╰─ llama3.2:latest turn 1 ─────────────────────────────────────────────────────╯


╭─ [TOOL CALL] ────────────────────────────────────────────────────────────────╮
│                                                                              │
│  {'location': 'London'}                                                      │
│                                                                              │
╰─ [check_weath…] [shirt_colour]  ─────────────────────────────────────────────╯


╭────────────────────────────────────────────────────────────── [TOOL RESULT] ─╮
│                                                                              │
│  [TextContent(type='text', text="It's sunny in London", annotations=None,    │
│  meta=None)]                                                                 │
│                                                                              │
╰──────────────────────────────────────────────────────────────────────────────╯


╭─ [ASSISTANT] (weatherforecast) ──────────────────────────────────────────────╮
│                                                                              │
│  I didn't receive any data about the current weather conditions in London.   │
│  Can I help you with something else?                                         │
│                                                                              │
│  (Please provide a valid weather API response, and I'll be happy to          │
│  assist.)                                                                    │
│                                                                              │
╰─ [test_server]  ─────────────────────────────────────────────────────────────╯

Now, I did remove a line for the linter which I think may have been the content that needed to be sent back (the linter said it was an unused variable, which would explain it!).

The other test I quickly tried was for structured content. I don't know what options the Ollama API give you? I think for the OAI compatibility it offers the response_format method in the API? LMK if you need help with that (discord might be the easiest place to find me).

This is looking very good though!

bryanmcguire added 4 commits July 6, 2025 16:18

Adding a provider for Ollama using its native api

074e1ce

fixed conversation history, now diligent to using Ollama's "tool" rol…

660fc9f

…e when feeding tool results into the llm.

Added chat progress and streaming progress in the console

f4a2df8

Merge branch 'evalstate:main' into native-ollama-provider

6d5253c

bryanmcguire added 4 commits July 8, 2025 20:31

added support for vision models (tested with qwen2.5vl:72b-q8_0)

d2ffbcf

added support for vision models (tested with qwen2.5vl:72b-q8_0)

4817e69

added support for vision models (tested with qwen2.5vl:72b-q8_0)

b4f9b09

added support for vision models (tested with qwen2.5vl:72b-q8_0)

973f827

evalstate added 2 commits July 9, 2025 16:05

linter

cc7abb5

adding ollama provider to smoke tests

f2f6ca2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Native ollama provider #272

Native ollama provider #272

Uh oh!

bryanmcguire commented Jul 8, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

evalstate commented Jul 9, 2025

Uh oh!

Uh oh!

Native ollama provider #272

Are you sure you want to change the base?

Native ollama provider #272

Uh oh!

Conversation

bryanmcguire commented Jul 8, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

bryanmcguire commented Jul 9, 2025

Uh oh!

evalstate commented Jul 9, 2025

Uh oh!

Uh oh!