Anyone can build a demo
Running it is the job

I help growing companies ship software that holds up in production, and keep it running. AI included, when it earns its place

What I do

I own the technical decisions from first use case to production: what to build, what to buy, what it costs to run, and who operates it after launch. Strategy when you need it, engineering when you need that too.

  • Production AI
  • Agent reliability
  • Full-stack engineering
  • Fractional CTO / CAIO

Find the right use case

Not every problem needs AI, and not every AI needs an agent. I help you pick the few use cases that pay for themselves, and rule out the ones that won't.

Build the system

Agents, RAG, automation, and the systems around them. Built for reliability, observability, and a cost you can predict. Hosted, or on open models you control.

Build and operate, not just advise

Code, architecture reviews, production debugging, and staying on to keep it running. I work in your repository, not from the outside.

From signal
to system.

From a demo that works to a system you can leave running.

  1. Tune in

    Understand the real problem, the data, and the constraints before writing code or reaching for a model. Most failures are framing failures, not technical ones.

  2. Compose the architecture

    Choose the right stack and shape the system: services, data, and where AI actually earns its place. Design for reliability, cost, and the team that inherits it.

  3. Ship and harden

    Get it live, then make it dependable: tests, monitoring, guardrails, evals where AI is involved, and clear ownership.

  4. Adopt responsibly

    Governance sized to your company: security, data protection, EU AI Act readiness where it applies, decisions you can explain.

Readiness scorecard

A typical first engagement, scored across the four dimensions where production systems usually break. Most teams arrive strong on framing and weak on production readiness.

Problem framing 35%
7 / 10
Stack & architecture 30%
6 / 10
Production readiness 25%
4 / 10
Governance 10%
3 / 10

The environments change
The standard for shipping doesn't

Diagnose
Build
Run
Lead

A focused audit, one system shipped, ongoing reliability, or embedded leadership. Pick the depth you need.

  1. Readiness & Production Audit

    A focused review of your product, data, and stack: where AI fits, where it doesn't, and what it takes to run in production. You leave with a prioritized use-case shortlist, an honest build/buy/skip call, and a cost model.

    • Use-case mapping
    • Agent feasibility
    • Build vs buy
    • Risk & EU AI Act check
    • Cost model
    Strategy Fixed scope · 1-2 weeks
  2. Build Sprint

    One system shipped to production: the app, the infrastructure, and the AI inside it. Hosted or on open models you control. Not a prototype that stops at the demo.

    • Frontend to infra
    • Python & TypeScript
    • Agents & RAG
    • Open or hosted models
    • Monitoring & guardrails
    Full-Stack Project · 6-10 weeks
  3. Reliability Retainer

    The keep-it-running part, as a standing engagement: evals, monitoring, guardrails, drift checks, incident response, and EU AI Act re-assessment. For teams who already shipped and should not be operating it alone.

    • Evals & monitoring
    • Guardrails
    • Drift & cost control
    • Incident response
    Operations Monthly · ongoing
  4. Fractional CTO / CAIO

    Senior technical leadership without the full-time hire: architecture, delivery, code review, and AI strategy. Technical, not purely advisory.

    • Architecture & roadmap
    • Team & vendor selection
    • Delivery ownership
    • Board-level reporting
    Leadership Monthly retainer

Who I've worked with

About

I'm an engineer and technical lead based in Berlin. I care most about systems that have to keep working.

I've been building software for about 25 years. I started as a developer, writing apps and the real-time GPU and audio code where a dropped frame is a bug you can hear. A lot of it was for the stage and custom software clients. For years I worked as a creative developer for theater and performance, including with artists like Philip Glass.
In startups, that work grew into more than writing code. I took on architecture, started leading small teams, and over time ran engineering as a CTO, building plenty of infrastructure and custom systems along the way. These days I help companies get AI into production and keep it running. Usually the unglamorous parts: the infra, the plumbing, the custom pieces that don't come in a box. Often agents, often on open models they host themselves.

  • Fractional CTO / CAIO
  • AI Systems & Machine Learning
  • Agents, RAG & Automation
  • AI Reliability & Evaluation
  • Real-Time Graphics & GPU
  • Audio Engineering & DSP
  • Product Architecture
Stack, among others
Languages
Python, TypeScript, JavaScript, C, C++, C#, Rust, GLSL
AI / ML
PyTorch, LangChain, LlamaIndex, OpenAI & Anthropic APIs, open-weight models (Llama, Mistral, Qwen), vLLM, RAG pipelines, evals & guardrails, Hugging Face
Infrastructure
Docker, GitHub Actions, PostgreSQL, Redis, pgvector / Pinecone, Supabase, Hetzner
Scale & Ops
Nginx, Kubernetes, Terraform, Ansible, Cloudflare, Kafka, AWS, GCP, Grafana
Web & Real-time
React, Next.js, Astro, Three.js, WebGL2, WebAssembly, Web Audio API
Creative Tech
TouchDesigner, vvvv, Max/MSP, Unity, JUCE, OpenFrameworks
Location
Berlin, DE
52°31′ N · 13°23′ E
Open to
AI readiness & production audits
AI & agent build sprints
Reliability retainers
Fractional CTO / CAIO

Questions,
answered.

What does a fractional CTO / CAIO actually do?

Senior technical leadership on a part-time or project basis: architecture, roadmap, delivery ownership, team and vendor selection, and AI strategy. The role stays technical, not purely advisory. In practice that means code reviews, architecture decisions, and production debugging alongside the team, without the cost of a full-time hire.

What do you mean by "agent reliability"?

A demo agent and a production agent are different problems. In a demo it answers once, watched by a human. In production it acts on its own, repeatedly, against changing data. The failure modes are looping, hallucinated actions, runaway cost, and silent regressions. Reliability is the work that makes that safe: evals you can trust, guardrails, bounded autonomy, monitoring, and a clear rollback path. It is the part most teams skip.

Do you work with open models, or only hosted APIs like OpenAI?

Both. Hosted APIs are often the fastest start. But when data has to stay in your infrastructure, when per-token cost does not scale, or when you want to avoid vendor lock-in, I build on open models like Llama, Mistral, or Qwen, self-hosted on infrastructure you control. I run open models in production today, so this is not theoretical.

Do you only advise, or do you also write the code?

The engagement includes code. I work inside your repository, not from the outside, and I stay on to operate what we ship. Depending on the engagement, that spans implementation, architecture reviews, production debugging, and ongoing reliability.

How do you decide whether a company actually needs AI?

Not every problem needs AI, and not every AI needs an agent. I start with the use case: what problem are you solving, and does AI change the outcome enough to justify running it? Most AI failures are framing failures. The technology is rarely the hard part. The Readiness & Production Audit answers this before any build work begins.

What is the "EU AI Act check" in the readiness audit?

It is part of the Readiness & Production Audit. I assess whether a proposed AI system falls under the Act's scope, which risk category it lands in, and what compliance obligations that creates. It also covers data protection and governance, scaled to your size and intended use. For live systems, the Reliability Retainer re-assesses this as the rules and your usage change.

Which engagement should I start with?

Unsure whether AI is the right move? Start with the Readiness & Production Audit: fixed scope, a use-case shortlist, an honest build/buy/skip call, and a cost model. Already know what to build? A Build Sprint ships one system to production. Already shipped? The Reliability Retainer keeps it running.

Where are you based, and do you work remotely?

I am based in Berlin and have lived here since 2007. I work with startups and growing companies regardless of location. Email me at contact@larsullrich.de. I read every message personally and usually reply within a day.

Let's
Talk.

Tell me what you're building and where it's stuck. If I'm the right person to help, I'll say so. If I'm not, I'll tell you that too.

Currently open to AI readiness audits, agent and production builds, reliability retainers, and fractional CTO/CAIO engagements.