← all guides
00Pillar guide

Process automation with AI: frameworks, tools, and what to ship first

How to identify the operations in your business worth automating, the modern stack (n8n, Make, custom code, LLMs), and the decision rules that determine whether AI belongs in the workflow or whether deterministic code would do the job better and cheaper.

May 24, 2026~11 min read2,900 wordsby Amine Hn

The most expensive operations in most businesses are the invisible ones. Not the work that has its own dashboard and its own team — that work is at least visible enough to be optimized. The expensive work is the workflow nobody owns, the steady drag of small repetitive tasks that get done because they have to be done, by whoever's available, in whatever way they figure out.

Process automation, done well, is the art of finding that work and replacing it with something the team doesn't have to think about. In 2026 the toolset for doing this is mature; the engineering is largely solved. What's not solved is the discipline: figuring out which work is worth automating, in what order, with what design rules. This guide is about that discipline.

What's changed (and what hasn't) in 2026

The technical stack for process automation has converged. The credible options:

What's new in 2026 is the routine inclusion of LLM steps inside these workflows. Five years ago "automation" meant moving structured data; now it means handling unstructured input — emails, documents, transcripts, images — alongside the structured parts. Every credible workflow tool has native LLM nodes; the orchestration tools have caught up to where they should already be.

What hasn't changed: the audit discipline that determines whether an automation project succeeds. The tools are easier; the choices are still the choices. Most failed automation projects in 2026 fail for the same reasons they failed in 2019.

The audit before the build

Before you write a workflow, audit the operation. The audit answers four questions:

Question one: what is the operation, named precisely? Not "customer onboarding" — "the daily process by which new self-serve trial signups get enriched, scored, and routed to either a rep or a nurture sequence." The precision matters because it determines what you measure later.

Question two: what are the inputs and outputs? Inputs: every trigger that starts the workflow, every system the workflow reads from. Outputs: every system the workflow writes to, every action taken in the real world. Draw this on one page. If you can't fit it on one page, the scope is too big; break it into smaller workflows.

Question three: what are the edge cases? What about the trial signup from a free-mail domain. What about the duplicate signup. What about the signup that comes in at 3 AM on a holiday. What about the signup where the company name has special characters that break a downstream API. Write down the top ten edge cases. If you don't, the workflow will discover them in production and you'll burn three weeks fixing them retroactively.

Question four: what's the cost of being wrong? A misrouted lead is recoverable. An incorrectly tagged invoice is recoverable. A patient whose appointment confirmation went to the wrong number is annoying. A wire transfer sent to the wrong account is catastrophic. The cost of being wrong determines how much guardrail you need around the workflow, how much human-in-the-loop, and what the kill criteria are.

This audit takes 1-2 days for a single workflow. Skipping it is the most common reason process automation projects produce technically-functioning workflows that don't deliver ROI.

When AI belongs in the workflow

A simple decision rule:

Use deterministic code when:

Use AI (an LLM step in the workflow) when:

A practical example. An invoice-processing workflow:

Most production workflows are this kind of hybrid. Pure-AI workflows are usually slower, more expensive, and harder to debug than necessary; pure-deterministic workflows can't handle the unstructured middle.

The handoff to humans

Almost every production workflow has a review step somewhere. The interesting design question is where.

Approve before send. The workflow produces a draft (email reply, outbound message, generated invoice); a human approves with one click before it goes out. Right for high-stakes outputs (customer-facing communications, anything irreversible). Costs a small amount of human time per case, saves most of the value of the rest of the workflow.

Flag low-confidence cases. The workflow runs end-to-end most of the time but routes cases the AI is unsure about to a human queue. Right when the workflow handles a high volume of mostly-easy cases plus a tail of judgment calls. Most support deflection workflows look like this.

Audit periodically. The workflow runs autonomously; a human samples N% of outputs weekly and reviews them. Right when the cost of being wrong on any single case is low but you want quality drift detection.

Override at any point. The workflow runs but there's a one-click "stop / re-do / override" path the operator can use whenever they notice something wrong. Right for monitoring-heavy workflows where the operator has eyes on the dashboard anyway.

Workflows with no review step at all are uncommon and should stay that way until you have months of eval data justifying full autonomy.

A modern reference architecture

The way we'd build a non-trivial process automation in 2026:

  1. Trigger layer: webhooks, schedules, polling, manual runs. Whatever fires the workflow.
  2. Orchestration: n8n or Make hosting the workflow definition. Each step is a node — API call, transformation, condition, LLM call.
  3. Deterministic transforms: where the data is structured, just code it. Most of the workflow.
  4. LLM steps: dropped in surgically where unstructured input meets a judgment call. Use the smallest model that wins your eval (cost matters at workflow scale).
  5. Memory / state: where the workflow needs to remember prior runs — Postgres, KV store, or the orchestration tool's own data store.
  6. Integrations: APIs to your systems — CRM, billing, ticketing, calendar, whatever. Native integrations in n8n/Make for the common ones; HTTP nodes for the rest.
  7. Approval / review: Slack message, email, dashboard — wherever the human looks. Inline buttons for approve/edit/reject.
  8. Observability: every run logged with inputs, outputs, model responses, errors. A simple dashboard you'd actually check.
  9. Alerts: workflow failures go somewhere a human will see them within an hour, not when the customer complains.

This isn't novel. It's the boring composition that wins. The teams that try to be clever about the architecture usually burn budget on something the boring composition would have handled.

What gets automated first, by industry

A non-exhaustive list of workflows we've shipped or quoted that pencil reliably:

The pattern across all of these: high-volume, structurally similar work where a competent operator could describe their decision rules. Pick from this list; ship one end-to-end; expand.

What we'd do differently than most vendors

A few opinions we'd put in writing:

We default to the hybrid stack. Workflow tool plus LLM step where needed, plus deterministic code where needed. Almost no production workflow we ship is pure-AI or pure-deterministic. Vendors that pitch a single tool that does everything are pitching a tool, not a solution.

We default to over-instrumenting before launch, removing later. Every step of every workflow logs in detail for the first 30 days. After that, we strip the verbose logging down to what's useful for ongoing ops. This is cheap and saves you the next "why did this fail in week eight?" mystery.

We default to small LLMs first. Claude Haiku, GPT-4o-mini, the smaller models. They're cheaper, faster, and good enough for most classification and extraction tasks. Drop in the bigger model only for the steps where the small one fails your eval.

We won't ship workflows without kill criteria. If you can't tell us what would make you stop using it, we shouldn't build it. Kill criteria force the audit to happen.

If you've got a workflow you want to scope

Most process automation engagements we run start with the team handing us a list of operations and us coming back with a written priority order plus quotes. The audit step is the most valuable thing we do — often the recommendation is to redesign the upstream process before automating anything downstream. Send us the operations list and we'll respond with a written scope inside a week.

·FAQ

Frequently asked.

What is process automation, and where does AI fit?
Process automation is the practice of replacing manual operational work with software workflows. Historically (and still mostly) this means deterministic automation: scheduled jobs, webhooks, API integrations — data flows from one system to another with no human involvement. AI fits where the operation requires judgment on unstructured inputs: reading an email and deciding what to do with it, summarizing a call into a CRM note, categorizing a document. Modern process automation is a hybrid — deterministic where you can, AI where you must.
n8n vs Make vs Zapier — which should we use?
Different tools for different volumes. Zapier wins on ease-of-use for small teams running 100-2000 task runs per month with simple integrations. Make.com (formerly Integromat) wins on cost and complexity — it handles branching workflows, has better cost-per-run economics, and the visual builder is more capable. n8n wins when you need self-hosting (compliance, data residency), custom code nodes, or scale beyond what cloud tiers price reasonably. We default to n8n for engagements above 50,000 task runs per month or where there's a data residency requirement; Make for everything else under that ceiling.
When is AI the right answer vs deterministic code?
AI is right when the input is unstructured (text, voice, image) and the decision requires judgment a regex can't capture. AI is wrong when the input is structured and a deterministic rule would work fine. A surprising number of 'AI automation' projects could ship 70% of their value with deterministic logic plus one well-placed LLM call. Use the model where it earns its keep; let regular code carry the parts of the workflow that don't need a model's judgment.
What's the first process to automate?
Pick an operation where (1) you can name the operator by title, (2) you can count how many hours per week they spend on it, and (3) you can describe what 'done' looks like in one sentence. Best candidates: inbox triage, document intake, invoice categorization, appointment confirmation, lead enrichment. Worst candidates: anything where the cost of being wrong is high or the failure mode is invisible.
How much does process automation cost to build?
A single deterministic workflow on Make or n8n: $1,500-$5,000 in build cost plus $50-$200/month platform. A workflow with AI steps: $5,000-$25,000 setup plus inference costs and platform. Multi-workflow rollouts with monitoring and integration depth: $30,000+. The cost scales with integration count and edge case coverage, not with technical complexity of the AI step.
What's the biggest mistake in process automation projects?
Automating the wrong process. Most projects optimize a workflow that should have been redesigned first. If the underlying process has been broken for three years, automating it just lets you reach broken-at-scale faster. Audit the operation as it exists; ask whether it should exist in that shape; redesign before you build. About 1 in 5 audits we run conclude that the right move is to fix the upstream process, not automate the downstream symptom.
Do we need a workflow tool, or should we just write code?
Use a workflow tool (n8n, Make) for 80% of cases. The visual builder, the prebuilt integrations, and the operational dashboards earn their keep. Drop to custom code only for the parts of the workflow that (1) require logic the workflow tool can't express cleanly, (2) need performance the tool can't deliver, or (3) handle sensitive data the tool's runtime can't be trusted with. Hybrid is the right answer for almost every non-trivial project.
How long does process automation take to ship?
A single workflow: 1-3 weeks from kickoff to live. Multi-workflow rollouts with monitoring: 3-8 weeks. The bottleneck after the first week is integration access — getting credentials, getting permission, getting the right person to flip the right switch in someone else's system. The technical work is fast; the org work is what stretches timelines.
06The discovery offer

Send us your most expensive operation.
We'll have an audit on your desk in five days.

One PDF. No deck. No obligation. We'll tell you whether AI is the right answer for it — and if it is, we'll quote the build the same week.