AI connector

Local LLM

Local LLM bridges a self-hosted model runtime (Ollama, llama.cpp or vLLM) into llm:// routes, so flows and agents can request completions and list models with the same URI contract — and project them to MCP tools / A2A skills, keeping inference on-device.

planned ✓ verified

What it does

  • Summarise a flow step's output with a local model.
  • List installed models from an agent.
  • Keep inference on the LAN with no external API.

URI schemes

llm://

Requirements

  • a local LLM runtime (Ollama / llama.cpp / vLLM)