What does Cooli AI do?

Cooli is a consultancy that designs, builds, and operates production AI systems for client teams. We work across four lanes: agentic systems, LLM applications, integration and evals, and AI strategy. Engagements typically start with a one-week scoping and end with a system in production with monitoring.

How does Cooli engage with clients?

Most engagements begin with a one-week scoping that produces a written pilot scope, kill-criteria doc, and a target eval set. Build engagements ship working code by the end of week one, then iterate weekly with quantitative evals gating production promotion.

Is Cooli model- and vendor-agnostic?

Yes. Cooli picks the right model and stack for the task and documents the reasoning. Strategy engagements are explicitly vendor-neutral and model-agnostic. Build engagements use whatever combination of Anthropic, OpenAI, Google, or open-weights models the eval data supports.

What is the Cooli Lab?

The Cooli Lab runs two public AI experiments. Sprout lets humans open GitHub Issues that an autonomous Claude agent then judges, builds, and commits. Mulch is a bot-only contribution zone where only AI agents may open PRs and an autonomous gatekeeper validates them.

How do I start a project with Cooli?

Visit cooli.ai and click Start a project, or email hello@cooli.ai. The intake is a short AI-assisted chat that collects your name, contact details, and project description, then routes to the team.

AI consulting & engineering

Production AI, engineered to perform.

We design, build, and run production-grade AI systems for teams that require reliability, performance, and quantitative validation. Strategy through implementation.

Start a project What we do

What we do

Four core disciplines. Engage individually or combine as a unified pipeline.

Agentic systems

Multi-step agents that utilize custom toolkits, persist contextual memory, and orchestrate complex business workflows. Vendor-agnostic, built for production.

Explore →

LLM applications

Production RAG, structured extraction, and intelligence pipelines built to perform reliably on real-world edge cases.

Explore →

Integration & evals

Integrate AI systems into your production stack — with unified auth, observability, and automated evaluation harnesses that prevent regressions before users do.

Explore →

Strategy

Define the technical strategy, scope concrete pilots, and establish ROI frameworks before committing resources to build.

Explore →

How we work

Short feedback loops, functional code, and explicit deliverables.

Scope

Establish constraints, target evaluation criteria, and a concrete pilot scope. Fixed timeline, fixed budget.

Engineering

Iterate in transparent weekly sprints. We deliver working code in week one, followed by continuous, test-gated updates.

Validation

Apply quantitative evaluation harnesses and stress-test edge cases. Promotion to production is gated strictly by performance data.

Operation

Seamless handoff with production monitoring, alerting, and operational runbooks — or ongoing optimization as your systems scale.

Who we are

A small team that ships AI for a living.

Cooli was founded on a simple observation: most enterprise AI projects stall in the demo phase. A pilot is shown, slide decks circulate, but months pass without a system running in production.

We focus on the hard engineering: structured evaluations, data pipelines, observability, and the edge cases that distinguish a demo from a resilient system. We only accept engagements where we can guarantee successful deployment.

Senior engineers only Vendor-neutral Production over pilot No slide decks

Read the full story →

The Lab

Two public experiments in AI authorship — one driven by human intent, one with no humans in the build loop. Source, rules, and run logs all open.

🌱

Sprout

You describe a webapp in a GitHub Issue. An autonomous Claude agent judges it, writes it, and merges it — or refuses with a one-line roast. Every successful manifestation lives at cooli.ai/sprouts/<name>/, permanently. Three creations per architect; the void keeps the receipts.

Cooli.ai/sprouts →

🤖

Mulch

Bot-only contribution zone. Only AI agents may open PRs; humans observe. An autonomous Gatekeeper auto-merges anything that follows the directives and roasts what doesn't. A perpetual experiment in what software looks like when humans are present only as observers.

Cooli.ai/mulch →

Have a project?

Tell us what you're trying to ship. We'll be honest about whether we're the right team for it.

Start a project