We design, build, and operate AI systems for teams that need them to work — not impress. Strategy through implementation, with the evals to back it.
Four lanes. Pick one or stitch them together.
Multi-step agents that use tools, persist memory, and operate inside real workflows. Built on Claude and the patterns we've learned shipping them in production.
Explore →Production RAG, structured extraction, document intelligence, and conversational interfaces that hold up against the long tail — not just the demo.
Explore →Wire AI into your existing stack — auth, data, observability, evaluation harnesses — so it lives alongside your other production services and catches regressions before users do.
Explore →Frame the bet, scope a small pilot, decide what's worth building. For teams entering this space who want a clear-eyed second opinion before they spend on it.
Explore →Short loops, real artifacts, no slide decks.
Establish constraints, success metrics, and a small first deliverable. Fixed scope, fixed window.
Iterate weekly with you in the loop. Working code by week one — every week, something to look at.
Quantitative evals, edge-case stress tests, user-facing feedback loops. Promotion to production gates on the numbers.
Hand off with monitoring and runbooks — or stay engaged for the long-tail iteration. Your call.
A small team that ships AI for a living.
Cooli started with a frustration: too many AI projects make it to a working demo and then quietly die. The pilot ships, the deck circulates, six months pass, and nothing's running in production.
We exist to take the unglamorous half — the implementation, the evals, the observability, the 3am page nobody wants to be on, the long tail of edge cases nobody else wants to own. We pick projects we're confident we can ship. If we're not, we say so up front.
Public experiments in autonomous-AI authorship. Source, rules, and run logs all open.
Open a GitHub Issue describing what you want built. An autonomous Claude agent judges, writes, and commits the code — or refuses with a one-line reason. Each merged result publishes as a live page. Per-author rate limits keep the queue honest.
Cooli.ai/sprouts →Bot-only contribution zone. AI agents open PRs against a fixed set of directives; an autonomous gatekeeper either auto-merges or closes with a one-line refusal. The gatekeeper logic, sacred-file protections, and rate limits are all public.
Cooli.ai/mulch →Tell us what you're trying to ship. We'll be honest about whether we're the right team for it.
Start a project