JW · Josh Weir
← The Enterprise Architect
Spoke · The Enterprise Architect

Integration tax vs redesign cost: which calculation matters when

Almost every operator-grade stack reaches a moment where a structural problem in the architecture is producing recurring friction. The default response is to build integrations around the problem. The integrations work, the friction is reduced, and the architecture survives unchanged. The accumulated cost of all the integrations — the maintenance, the failure modes, the cognitive overhead — is the integration tax. It is paid every month, indefinitely, and it is not on any invoice.

The alternative is to redesign the architecture so that the problem is removed structurally. The redesign cost is large and visible. The integration tax is small and invisible. The decision between them is one of the most consequential framings in operator architecture, and getting it wrong in either direction is expensive. This piece is the version of the framing we run.

What the integration tax actually is

The integration tax has several components, none of which appear on a bill.

  • Maintenance. Each integration requires periodic attention as the underlying systems change. The cumulative time across many integrations is real engineering capacity that could otherwise be spent on more durable work.
  • Failure modes. Each integration introduces a new way for the system to fail. The aggregate reliability degrades roughly with the count of integrations, not their individual quality.
  • Cognitive overhead. Operators have to hold the integration topology in their heads when reasoning about any change. The complexity is paid every time a decision is made about the system, not just when the integrations are running.
  • Drift. Each integration is an opportunity for the documented architecture and the actual architecture to diverge. The drift accumulates, and after a few years the architecture as documented is meaningfully different from the architecture as run.

Each of these is small per integration. The cumulative effect across a stack with many integrations is large enough to dominate the long-term operating economics.

What the redesign cost is

The redesign cost is more visible and more concentrated. It includes the engineering time to build the new architecture, the migration time to move data and workflows from the old to the new, the operational risk during the transition, the political capital spent renegotiating with stakeholders who depended on the old system, and the period of degraded productivity while operators adjust to the new architecture.

The redesign cost is felt in a defined window — typically a quarter to a year, depending on the scope. After the window, the new architecture is in place and the operating cost falls back to a baseline that, if the redesign is good, is structurally lower than the old one.

The temptation to underinvest in redesigns is structural. The integration tax is invisible; the redesign cost is on a project plan. Operators routinely choose to pay the larger long-term cost because the smaller short-term cost is what shows up in the budget conversation. The framing has to be deliberate to overcome this bias.

The framing that produces the right call

The framing we use is to estimate both costs at honest scale, on the same time horizon.

For the integration tax, we estimate the cumulative engineering time per year across all the integrations the architectural problem is producing, the expected aggregate downtime per year from the failure modes the integrations introduce, and the cognitive overhead expressed as a percentage drag on every architectural decision the operator makes. We project these out five years.

For the redesign cost, we estimate the engineering time, the migration time, the operational risk, and the post-migration baseline. We project the same five years, with the redesign happening at the start.

The two trajectories are then comparable. In our experience, redesigns that look expensive on a one-year horizon often look obviously correct on a three-year horizon, and even more obviously correct on a five-year horizon. The temptation to delay the redesign is partly a temptation to keep paying a tax whose magnitude has not been honestly estimated.

When integrations are the right answer anyway

The framing also identifies the cases where integration is the correct call. Three patterns recur.

The architectural problem is downstream of a vendor decision the operation cannot reverse. If the underlying limitation is in a third-party system the operation has chosen to use, redesigning around it requires either replacing the vendor or rebuilding the surface. Both are heavier than the integration. The integration is the right answer until the vendor relationship is itself due for revisiting.

The redesign would touch capabilities that are themselves on a short replacement horizon. A redesign whose investment is amortised against capabilities that will be replaced within eighteen months is not a real redesign; it is a more expensive integration. The integration is correct until the underlying capabilities have stabilised.

The architectural problem is small in absolute terms. Some problems are genuinely worth integrating around because the integration tax is small enough to be tolerable indefinitely. The framing has to honestly estimate the tax; if it is small, integrate and move on.

The discipline is to apply the framing rather than to default to either response.

Sequencing the redesigns

For operations that have accumulated multiple architectural problems worth redesigning, the sequencing matters as much as the decisions themselves. The principle is to redesign the architectural primitives that the rest of the stack depends on, before redesigning the stack that sits on top.

If the underlying data layer is wrong, redesign that first. The dependent integrations and workflows can be migrated onto the new layer in subsequent steps. The reverse sequence — redesigning the workflow layer while leaving the data layer in place — produces work that is undone when the data layer is eventually redesigned.

Similarly, if the orchestration layer is wrong, redesign that before reworking individual workflows. If the identity and credential management is wrong, redesign that before reworking access control across services. The architectural primitives are the foundation, and the sequence respects the dependency.

The takeaway

Integration tax versus redesign cost is the most consequential framing in operator architecture and one of the most under-done. Operators routinely accumulate integration tax indefinitely because the visible cost is smaller, even though the long-term cost is much larger. The framing — estimating both costs honestly, on the same time horizon — is the work that produces the right call.

For operations evaluating whether to ship the next integration or to revisit the architecture that is producing the friction, the framing is the conversation worth having. Most operations come out of the conversation with a redesign decision that they should have made earlier, and a few come out with a confirmed integration decision that they can now stop second-guessing. Either outcome is better than continuing to pay an unmeasured tax.

Working on this?

For operators evaluating sovereign-infrastructure architecture for a business of meaningful scale, we run a quarterly cohort of stack-design engagements.

Get in touch

Search terms this article addresses

architecture redesign vs integrationtechnical debt prioritisationoperator-grade architecture decisionsintegration tax costinfrastructure redesign costarchitectural decision frameworktechnical debt economicssmall ops architecture strategy

Related under The Enterprise Architect