Simulation for improvement

If you have been following along with my robot car guided by AI project you’ll know that I got it working with Gemini. I then switched to using Azure AI Foundry. Finally, I changed the code so that car ran most of the local decisions while the AI just made the major navigation decisions.

Each time navigation did improve, but I would have to observe the movements and tell the AI what issues I’d seen and then the Ai would go off and improve the code. I’d then test and update again, round and round in a loop. Each time there was a small incremental change in navigation ability but I still faced edge cases the existing algorithm struggled with.

I could have continued this update, observe, update loop infinitum, making constant incremental improvements. However, when I stepped back and though about how AI systems improve I realised that it was via repetitive learning in a simulated, rather than real environment.

I therefore asked Github Copilot to simulate the algorithm of the robot cars navigation and then test that via simulation. Any improvements learned via the simulation should then be applied back to the robot cars code. I then unleashed Github Copilot to complete this task, without asking for input, until it reached a near prefect navigation result in the simulator.

The result is this code:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-nav-max.ino

which I will say, although far from ‘perfect’ achieved much better results than the incremental ones I was obtaining simply by observational feedback.

The success I’ve had here using the concept of a simulation to test the algorithm and then continue to iterate to improve the code has made me think about what else I could apply this ‘simulation’ process to when other AI work I am performing.

Interesting.

Having the AI make make better decisions

After some Ai prompting it was suggested that my robot car navigation would improve if the car did more itself with local routines and only used the AI when it could not decide so I plugged away and ended up with this code:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-nav-max.ino

you will see:

An ESP32 robot car drives forward until its ultrasonic sensor (mounted on a pan servo) detects an obstacle. On each hazard event a two-tier decision pipeline fires:

1. LOCAL PLANNER – scores three candidate maneuvers (strafe, 90° turn, and backup+turn) using live L/F/R distance readings, front-clearance trend, and oscillation history. If the result is “obvious” (high confidence, low risk, no repeated-trap condition) it is executed immediately without any network round-trip.

2. FOUNDRY ARBITER – when the local answer is ambiguous, or a repeated-trap is detected, a structured prompt is posted to an Azure AI Foundry Responses API endpoint. The LLM names one candidate plan and returns a confidence score. The response is validated and sanitized; any failure falls back to the local plan.

After every maneuver the robot re-scans, records the front-clearance delta, and feeds plan quality (improved / worsened) back into the next decision cycle.

After a quick test, navigation does seem more intelligent, although it seems to make decisions a long way away and tends to get lost in wide open spaces with object far away. However, I think that if I nvested more time I could improve all that.

I have an additional idea on how I might improve the navigation in an upcoming article, but for now I think I’m pretty much done with the concept of navigation.

Connecting the robot car to Azure AI Foundry

If you have been following along you will know that I’ve connected a robot car to both a local LLM and cloud based Gemini. The last iteration is here:

https://blog.ciaopslabs.com/2026/07/18/controlling-a-robot-car-with-ai/

I decided that I should connect the car to Azure AI Foundry because there are so many more models available as well as everything else that comes with Azure.

So I set up a simple Foundry project

Step 1: Create an Azure AI Foundry Project

Browse to Azure AI Foundry:
Azure AI Foundry
Sign in with your Azure account.
Select:
- Create Project
- Project Name:
```
RobotNavigation
```
Create or select:
- Azure Subscription
- Resource Group
- Azure AI Services Resource
Wait for deployment to complete.

Step 2: Deploy a Model

Within your Foundry project:

Open:
```
Model Catalog
```
Deploy:
- GPT-5-mini
- GPT-5.1-mini
- GPT-4.1-mini

For a robot car:

GPT-5-mini

is usually sufficient and inexpensive.

Step 3: Obtain Connection Details

From Foundry copy:

Endpoint URL

API Key

Model Name

C++

#define FOUNDRY_RESPONSES_URL \

“https://robot-navigation-resource.services.ai.azure.com/openai/v1/responses”

#define FOUNDRY_MODEL “gpt-5-mini”

The recommendation was the use the gpt-5-mini model, so I plugged it into the existing code, made a few improvements and ended up with this:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-foundry.ino

My observation is that the navigate is generally better but the delays are longer when it has to ‘thinlk’ (aka go to the LLM). This has to do with the size of model, basically gpt-5-mini vs gemini3-flash.

So, the lesson here is I need the smallest possible model for the job. For now I’ll stick with gpt-5-mini.

So more research indicates that I shoudl probably offload more processes to the local device and only the LLM at a much higher level. A better plan seems to bei instead of asking the LLM to invent a manoeuvre plan, ask it to choose from local candidate plans you already created.

So let me go and try that now.

Controlling a robot car with AI

After having AI show my fortune, the next project was having AI navigate my robot car.

To keep things simple I constrained movement to a 5 x 5 grid ( X = 0 – 4 and Y = 0 – 4) and the car could only move N, S, E or W one cell at a time. To get the algorithm right and test this with Ai before actually applying it to the car I simulated the result on the OLED screen I had previously configured.

With that all working I upgraded the code to run with the robot car and you’ll find it here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-cellmove.ino

The main issue I found was more mechanical with the robot car wheels dragging and not being very precise, so the car woudl easily wander off in different directions. This has more to do with teh quality of the motors and wheels as well as the friction encountered when the wheels start moving. However, aside from that the test worked successfully and the car cycled up the grid and then back down.

Next, I wanted the AI to actually assist with the navigation of the car. I went through plenty of iterations with this. The most important change is that I moved from using a local AI to using Gemini via API calls. The main reason for this was simply speed. As the prompts became larger the local AI model struggled to return the results to the car in enough time to implement effective navigation. I had also wanted to integrate large cloud based LLM so here was the opportunity, so I hooked up Gemini.

The robot car also has an ultrasonic senor connected to a motor at the front so I could sweep it left and right as well for better object detection. However, initially I kept it simple and all on teh car by just using the ultrasonic sensor to detect hazards and try to makeover around them. The code for that is here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-sweep.ino

I then upgraded the code to integrate the LLM into the navigation process by making decisions on which direction to turn. That code is here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-sweep.ino

One issues I ran into, that wasted a lot of my time and was totally my own fault, was when I started having issues with the wheel moving the car forward. I blamed the code but in fact, again it was a hardware issue, being the battery charge had become too low to actually drive the wheels acceptably. It is interesting at how quickly that car now actually drains power when fully running.

With the power issues resolved I upgraded the code a number of times to allow the LLM a much higher level of navigation control. You’ll find the final result here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-assist-nav.ino

The end result of all these experiments is that I have learned that in the full configuration the car now burns a lot of power as it moves, turns the sensor, communicates over wifi and more. all the changes I made to the code would make the car slightly less likely to crash into objects on the floor but a lot more though needs to go into ‘crash free’ navigation. The obvious improvement is to add more sensors to provide the LLM with more information to make better decisions. I also found that the wheel on the car are not precise enough and don’t really provide the best grip. This means they tend to be slow to engage and lag cause the car to veer.

I think all of these can be solved iteratively over time and I am confident that I can get to a situation that allows the robot car to move pretty much crash free around the floor just like a robot vacuum can already. However, the time required is probably not something that I’m willing to invest in just now to get a little incrementally better. I’m happy that my ‘proof of concept’ when it comes to navigation with LLMs works. I think it is time to move onto the next project.

Having the device show my fortune

After getting the ESP32 talking to the local LLM the next stage was to do something more than just flashing an LED. I decided that I’d use the LLM to produce a ‘fortune’ for me and then display that on an OLED screen I’d connect to the ESP32.

The OLED screen in question was this White I2C OLED display (SSD1306).

White I2C OLED display (SSD1306)

To use this OLED you need the Adafruit_SSD1306 library.

Here is the prompt being sent to the local LLM:

You are a mystical fortune teller. Give one short fortune. Maximum 12 words. No introduction. No quotes.

The result from the LLM is then displayed on the OLED screen which is connected to the ESP32-C3-DevKitM-1 via GPIO6 and GPIO7 acting as SDA and SCL communication ports. I also left the external LED on GPIO4, from the last project, as well to aid troubleshooting.

The code is here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-fortune.ino

and the results look like:

Video URL = ESP32 displaying results from local LLM

Getting device talking to LLM

With the LLM now up and working on a separate device on my LAN, the next step is to test it remotely to ensure that it works. For this I used the following simple PowerShell on a remote machine

which you will find here:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/echo-ping.ps1

Next, I ran the following PowerShell script:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/echo-test.ps1

which simply runs a standard prompt of;

“Reply with exactly: Hello from Ollama”

and then ensure that I get that reply back from the remote LLM server. This means I have communications to the actual server as well as the LLM.

With all the remote communications confirmed, the next step was to get a device talking to the LLM. For this I had a ESP32-C3-DevKitM-1 hanging around.

The main benefit of this device is that it has inbuilt Wifi. I connected up a LED and resistor to GPIO4 like so:

I then used this code in the device:

https://github.com/directorcia/Azure/blob/master/Iot/LLM/llm-flash.ino

to prompt the LLM for a number of flashes from 1 – 4, which the device would then complete that on the LED. I could also monitor the progress using the terminal, which would look like:

Connecting to Wi-Fi…..
Wi-Fi connected. IP: 192.168.1.42
Requested flash count: 3
Sending request to Ollama (attempt 1)…
HTTP status: 200
Ollama reply: {“flash”:3}

Initially I found that the LLM was returning the same number of flashes, so I needed to adjust the prompting to get some variation. The good news is that I got it all working and the resultant code is above.

So now I have successfully gotten a device talking to a local LLM. I’ll be expanding on this in upcoming articles but very happy that have this basic configuration all working now.

The local AI system so far

You can describe it like this:

The user connects via browser to the host machine
The request hits Open WebUI (running in Docker)
Open WebUI calls the Ollama API on port 11434
Ollama processes the request using local models
The response flows back to the browser

Simple mental model

Browser → WebUI → Ollama → Model → Response

Set up Summary

Getting Ollama running on Windows

– How to install Linux on Windows with WSL

Docker and Open WebUI deployment on Windows

Why Docker was needed

Why Open WebUI is required

Why Open WebUI is required for local AI

I am creating a local AI machine that runs Ollama. Steps completed so far:

Getting Ollama running on Windows

Docker and Open WebUI deployment on Windows

Why Docker was needed

Open WebIU is a critical design point in what you built, and understanding it properly will change how you think about deploying AI systems.

I’ll explain it clearly in the context of your setup — not generic theory.

The short answer

You needed to run Open WebUI on the same machine as Ollama because:

The machine running Ollama is the one doing the actual AI processing, and Open WebUI must run close to it to reliably act as the interface.

Everything else should just connect to that machine via a browser, not try to run AI locally.

What is actually happening in your setup

You now have three roles:

1. The host machine (Ollama PC)

This is your AI server

It runs:

Ollama → the AI engine (models, GPU/CPU work)
Open WebUI → the web interface
Docker → runtime for WebUI

2. The guest machines

These are clients only

They:

do not run models
do not run WebUI
simply connect via browser

3. The browser

The browser is the user interface, not the AI itself

The critical dependency chain

Your system works like this:

Browser → WebUI → Ollama → Model → Response

Why WebUI MUST be on the Ollama machine

1. WebUI is a frontend, not an AI engine

Open WebUI does not run models.

It:

sends prompts
receives responses
renders a chat interface

If you run WebUI on another machine:

it has no local models
it must connect over the network to Ollama
you introduce unnecessary complexity

2. Local communication is simple and reliable

When WebUI runs on the same machine:

Connection to Ollama is:
```
http://localhost:11434
```
(internal, fast, reliable)

If WebUI runs on another machine:

You must use:
```
http://<host-ip>:11434
```
Requires:
- firewall rules
- correct binding
- network stability
- security considerations

Why this matters

You already saw issues like:

connection failures
unhealthy containers
network access problems

Running WebUI locally eliminates most of those.

3. Performance (this is often overlooked)

AI workloads are:

CPU/GPU intensive
heavy in memory usage
sensitive to latency

If WebUI is remote:

every prompt travels across the network
every token response comes back across the network

While not huge, it adds:

latency
potential instability

Local WebUI gives:

near-zero latency between UI and model
predictable performance
fewer moving parts

4. You centralise compute instead of duplicating it

Your goal was:

“Run AI from another machine”

That does NOT mean:

“Run AI on every machine”

Correct design:

One machine → does compute
Many machines → access it

If you ran WebUI remotely

You would either:

duplicate the WebUI install everywhere
or still point it back to the same Ollama server

Which gives you:

more complexity
no real benefit

5. Security and control

When everything runs on the Ollama machine:

you control:
- firewall
- ports
- access
you know exactly where the AI is running

If WebUI is spread across multiple machines:

more endpoints
more risk
harder to manage access

6. Docker networking works naturally this way

This was a key issue you hit:

You had to use:

host.docker.internal

This only works cleanly because:

WebUI container = local to host
Ollama = local to host

If WebUI was on another machine:

Docker networking helps you less
you rely purely on LAN configuration
more points of failure

7. This is how real systems are designed

What you built mirrors real-world architecture:

Client → Web App → API → Compute

In your case:

Browser → WebUI → Ollama → Model

This is the same pattern used by:

ChatGPT (frontend separate from engine)
Azure AI services
enterprise AI platforms

8. Scaling (this is the big future reason)

Running WebUI on the Ollama host allows you to:

add components easily:
- vector databases
- RAG pipelines
- APIs
keep all services close together

If WebUI is remote:

scaling becomes messy
service orchestration becomes harder

What would happen if you didn’t do this

Let’s be very direct.

If you ran WebUI on a different machine:

You would need to:

configure Ollama network access properly
open firewall ports correctly
deal with API connectivity issues
troubleshoot latency and failures
manage multiple installation environments

And still end up with:

no performance gain
more complexity
harder troubleshooting

The real reason (simplified)

You ran WebUI on the host machine because:

The AI models must run locally on one machine, and WebUI must sit as close as possible to that engine to provide a reliable interface.

The correct mental model

Do not think:

“I am running WebUI on a machine”

Think:

“I am exposing an AI service from one machine to the rest of my network”

Final takeaway

Running WebUI on the Ollama machine:

keeps the system simple
reduces network complexity
improves reliability
centralises compute
aligns with real architecture patterns

One practical recommendation

Once you’re comfortable, the next step is:

add authentication back into WebUI
restrict access to trusted devices

This turns your setup from a lab into a proper internal AI service.

Ollama + Open WebUI deployment

I am creating a local AI machine that runs Ollama. Steps completed so far:

Getting Ollama running on Windows

Docker and Open WebUI deployment on Windows

In this case, Docker wasn’t just “nice to have”, it solved several very specific problems in your setup.

WHY DOCKER WAS NEEDED IN THIS SPECIFIC SETUP

1. Open WebUI is not designed as a simple Windows app

Open WebUI is built as a:

Python web application
With multiple backend dependencies
Designed primarily for Linux-style environments

Without Docker, you would have needed to:

Install Python (correct version)
Install pip dependencies
Handle compatibility issues
Configure environment variables manually
Run a web server process

This is fragile and error-prone on Windows.

What Docker did here

When you ran:

docker run ghcr.io/open-webui/open-webui:main

Docker:

Pulled a prebuilt environment
Contained everything (Python, libraries, configs)
Started the service automatically

So instead of building the environment, you consumed a known-good one

2. You needed a clean separation between components

Your architecture now looks like:

Ollama (host)
Open WebUI (container)
Browser (client)

Without Docker:

WebUI and Ollama would run on the same OS
Dependencies could conflict
Debugging becomes harder

What Docker did here

Docker created an isolated runtime for WebUI:

WebUI does not interfere with Windows
WebUI does not interfere with Ollama
You can remove/rebuild it safely

This is especially important as you expand (RAG, APIs, agents)

3. You needed predictable networking

One of the key issues you hit was:

WebUI couldn’t talk to Ollama
Needed host.docker.internal

Why Docker matters here

Docker introduces a controlled networking model

Instead of:

Random ports
OS-level binding issues

You get:

Defined port mapping: 3000 → 8080
Clear host access: host.docker.internal

This makes multi-service communication predictable

4. You needed rapid rebuild and recovery

You hit several issues:

Wrong container name
Auth issues
Unhealthy container
Image typo

Without Docker:

You would have had to:

uninstall software
clean configuration
reinstall dependencies

What Docker enabled

You fixed everything with:

docker rm -f open-webui

docker run …

Full rebuild in seconds, with zero cleanup

This is one of Docker’s biggest practical advantages.

5. You needed a deployable, repeatable system

Right now, what you built can be:

Recreated on another machine
Documented
Given to another tech
Deployed to a client

Without Docker

You would need:

A long install guide
Environment matching
Manual steps that can fail

With Docker

You only need:

docker run …

That’s a deployment standard, not just a setup

6. You needed multi-device access

Your requirement was:

“Access Ollama from other machines”

Docker enabled:

Running WebUI as a central web service
Exposing it safely via a single port
Allowing browser-based access

Without Docker

You would:

struggle to host a local web service reliably
deal with Windows service management
manually run servers

7. You needed extensibility (this is the big one)

Right now you have:

Ollama
Open WebUI

But next steps are:

RAG systems
Vector databases
APIs
Agent frameworks

Docker becomes critical here

Because it allows you to:

add services without breaking others
isolate each component
scale architecture cleanly

Example future stack:

WebUI → Ollama → Vector DB → API → Tools

This only scales cleanly with containerisation

8. Docker solved Windows-specific limitations

This part is often missed.

You are running on Windows.

Many modern AI tools assume:

Linux environment
Unix-style dependencies
specific runtime behaviour

Docker (via WSL2) solved this

Docker gave you:

a Linux-compatible runtime
consistent behaviour
no need to rewrite apps for Windows

Without Docker + WSL:

many tools simply would not run properly

THE REAL REASON DOCKER WAS REQUIRED HERE

In your exact setup, Docker was needed because:

You were running a modern, Linux-native, dependency-heavy application (Open WebUI) on a Windows machine and needed it to connect reliably to another local service (Ollama) while being accessible across the network.

Docker solved:

environment compatibility
dependency management
service isolation
networking consistency
deployment repeatability

WHAT WOULD HAVE HAPPENED WITHOUT DOCKER

Let’s be blunt:

You likely would have spent hours:

installing dependencies
fixing Python errors
resolving compatibility issues
debugging network bindings

And still ended up with:

a fragile setup
difficult rebuild process
non-repeatable deployment

PRACTICAL SUMMARY

Docker was not used here because it is “modern”.

It was used because it enabled:

Running Open WebUI without manual setup
Isolating it from your system
Connecting it reliably to Ollama
Recovering quickly when things break
Making the system repeatable and deployable
Supporting future expansion

FINAL TAKEAWAY

In this scenario:

Docker transformed a complex, fragile, multi-component system into a predictable, repeatable, and easily recoverable deployment.

That is exactly why it exists.

Ollama + Docker + Open WebUI Network Deployment Guide

Overview

This guide provides a complete step-by-step process to configure Ollama, Docker, WSL2, and Open WebUI for network access across multiple devices.

Part 1 – Host Preparation

Step 1 – Confirm Virtualisation

Open Task Manager → Performance → CPU

Confirm: Virtualisation = Enabled

Step 2 – Install WSL2

wsl --install
wsl --set-default-version 2

Reboot the machine.

Step 3 – Verify WSL

wsl -l -v

Expected:

docker-desktop       Running   Version 2
docker-desktop-data  Running   Version 2

Part 2 – Install Docker

Step 4 – Install Docker

winget install -e --id Docker.DockerDesktop

Step 5 – Start Docker

Open Docker Desktop and wait for “Docker is running”.

Step 6 – Test Docker

docker run hello-world

Part 3 – Configure Ollama for Network Access

Step 7 – Configure Listener

Add environment variable:

OLLAMA_HOST=0.0.0.0:11434

Restart Ollama.

Step 8 – Test Local Access

curl http://localhost:11434/api/tags

Step 9 – Test LAN Access

ipconfig
curl http://YOUR-IP:11434/api/tags

Part 4 – Deploy Open WebUI

Step 10 – Run Container

docker run -d ^
  -p 3000:8080 ^
  -e OLLAMA_BASE_URL=http://host.docker.internal:11434 ^
  -e WEBUI_AUTH=False ^
  --name open-webui ^
  ghcr.io/open-webui/open-webui:main

Step 11 – Verify Container

docker ps

Step 12 – Access WebUI

http://localhost:3000

Part 5 – Configure Firewall

Step 13 – Open Ports

New-NetFirewallRule -DisplayName "Open WebUI" -Direction Inbound -Protocol TCP -LocalPort 3000 -Action Allow
New-NetFirewallRule -DisplayName "Ollama API" -Direction Inbound -Protocol TCP -LocalPort 11434 -Action Allow

Step 14 – Test from Another Device

http://OLLAMA-PC-IP:3000

Part 6 – Load Models

Step 15 – Pull Model

ollama pull llama3

Step 16 – Test Chat

Use WebUI to send a test prompt.

Part 7 – Enable Persistence

Step 17 – Create docker-compose.yml

version: '3.8'

services:
  open-webui:
    image: ghcr.io/open-webui/open-webui:main
    container_name: open-webui
    ports:
      - "3000:8080"
    environment:
      - OLLAMA_BASE_URL=http://host.docker.internal:11434
      - WEBUI_AUTH=False
    volumes:
      - open-webui-data:/app/backend/data
    restart: always

volumes:
  open-webui-data:

Step 18 – Deploy

docker compose up -d

Troubleshooting

UI Not Loading

docker ps

Container Unhealthy

wsl --shutdown

Auth Blocking Access

WEBUI_AUTH=False

Cannot Access from Network

Open firewall ports 3000 and 11434

Use correct IP address