The Runtime-First Approach to AI: Why Architecture Order Matters

Andriy @ Oyyo · Mar 31, 2026

There are two ways to add AI to enterprise software.

The first way — the way most companies do it — is to take existing software and bolt AI onto it. You have a claims system, so you add an "AI extraction" module. You have a policy admin system, so you add a "smart search" feature. The AI lives outside the core application, calling it through APIs, processing data in separate pipelines, returning results to a different interface.

The second way is to build the runtime first.

What we mean by "runtime"

Before we wrote a single line of AI code, we built a platform where complex business applications could be deployed in seconds. Not configured over weeks. Not implemented over months. Deployed — fully functional, ready for real users — in seconds.

A user describes what they need: "I need a cyber liability intake form with clearance rules." The runtime generates the application — the forms, the data schemas, the validation rules, the email routing, the file organization — and it's live. Real brokers can email real submissions to a real address, and the data flows into a real workspace.

This sounds like a product feature. It's actually an architectural decision that changes everything about how AI works inside the system.

Why agents need a runtime

Here's the insight that shaped Oyyo: if a platform is fluid enough to deploy applications for human workers in seconds, it's the perfect environment for AI agents.

Think about what an AI agent needs to do useful work in insurance:

It needs to read documents (emails, PDFs, spreadsheets)
It needs to write structured data into forms
It needs to trigger workflows (clearance checks, premium calculations)
It needs to communicate with external parties (brokers, carriers)
It needs to file and organize its outputs

In a bolt-on architecture, each of these capabilities requires a separate API integration. The AI calls the document service, then calls the form service, then calls the workflow engine, then calls the email service. Each integration is a point of failure. Each handoff loses context.

In a runtime architecture, the agent doesn't call external services. It operates directly inside the same environment the human uses. It reads the email because the email arrived in the runtime. It fills the form because the form is a runtime object. It triggers the workflow because the workflow is a runtime process. No APIs. No integrations. No context loss.

What this means in practice

When an AI agent at Oyyo extracts data from a broker submission, it's not returning a JSON blob to an external system. It's filling in the same form fields that the underwriter sees. The underwriter can watch the fields populate in real-time. They can correct a field while the agent is still processing. The agent sees the correction and adapts — immediately, in the same session, not after a retraining cycle.

This is only possible because the agent and the human share the same runtime. They're both first-class citizens of the same platform. The agent has its own UI surface (showing confidence scores, source attribution, reasoning). The human has their UI surface (showing the business-friendly form, the Today board, the journal). But underneath, they're operating on the same data, in the same workspace, at the same time.

The 30-second deployment

People often ask how Oyyo can deploy a new AI-powered insurance product in 30 seconds. The answer isn't "we have a really fast AI model." The answer is that the runtime was built for instant deployment from day one. When you tell the AI Studio "build me a cyber liability rater," the runtime generates the application — and the agents that operate it — simultaneously. There's no separate "AI configuration" step because the AI was never separate from the runtime.

Compare this to a bolt-on approach: you'd need to configure the application (weeks), then configure the AI extraction rules (more weeks), then build the integration between them (more weeks), then test the integration (more weeks). By the time it's deployed, the business need has moved on.

Runtime-first, not AI-first

We deliberately chose to build the runtime before the AI. Not because AI wasn't important — it was always the vision. But because we understood that AI without a proper runtime is just a faster way to produce outputs that still need to be manually moved between disconnected systems.

The runtime is the operating system. The agents are the applications. And together, they create something that neither could achieve alone: a workspace where humans and AI collaborate naturally, without integrations, without handoffs, and without the constant context-switching that makes enterprise software feel like enterprise software.

That's the architecture behind the 8-minute quote. Not a faster model. A fundamentally different place for the model to live.

See Oyyo in action

Build your first AI-powered insurance app in under 5 minutes.

Start building