Why Integration Tests Won't Save Your Microservices

Kevin Brown on Apr 7, 2024

6 min read

Consumer and provider services separated by two-way mirror with glowing contract showing mutual expectations between them

A User Service team ships what they consider a minor change: renaming userId to user_id in their response payload to match their new coding standards. They updated their OpenAPI spec. They ran their own tests. Everything passed.

Three services broke in production that Friday.

The Order Service, Shipping Service, and Analytics Service all consumed that field. Nobody had checked with them. The deployment happened on a Friday afternoon. The on-call engineer spent the weekend coordinating rollbacks and emergency patches.

This scenario plays out constantly in microservice architectures. The instinctive response is to add more integration tests. If Service A calling Service B fails in production, write a test that spins up both services and verifies the call works.

That instinct is wrong. Integration tests won’t save you here. Consumer-driven contracts will.

The Integration Test Trap

Integration tests require all services to be running simultaneously. For a simple three-service chain, that means coordinating databases, message queues, and network connectivity across all three. Add a fourth service, and the coordination overhead grows. By the time you have 20 services, the “integration test environment” has become a full-time job for someone on the platform team.

The problems compound quickly. Shared test databases accumulate garbage data from previous runs, causing tests to fail for reasons unrelated to the code change. Port conflicts appear when two developers run tests simultaneously. Network timeouts introduce flakiness that erodes trust in the test suite. Developers stop running integration tests locally because they take too long, pushing the feedback loop to CI where it’s even slower.

When an integration test fails, debugging becomes archaeology. A failure in Service X might be caused by a change in Service Y, but the stack trace only shows where the exception was thrown, not where the problem originated. Reproducing the failure locally requires spinning up the entire environment, which often behaves differently than CI.

Comparison of testing approaches for API compatibility.
#	Testing Approach	Speed	Reliability	Isolation	Catches Breaking Changes
1	Unit tests	Fast	High	Complete	No (mocks hide reality)
2	Integration tests	Slow	Low	None	Yes (in staging/prod)
3	E2E tests	Very slow	Very low	None	Yes (in production-like)
4	Contract tests	Fast	High	Complete	Yes (at PR time)

Comparison of testing approaches for API compatibility.

The fundamental problem is that integration tests conflate two concerns: verifying that your code works correctly, and verifying that your code is compatible with its dependencies . Unit tests verify correctness. Contract tests verify compatibility. Neither requires spinning up the entire world.

Teams often try to solve the compatibility problem with documentation instead. “We’ll maintain an OpenAPI spec, and consumers will code against it.” This works for about three months. The spec was written when the API was first built. Six months later, someone refactored the response format. They updated the code, ran the tests, and shipped. The OpenAPI spec stayed frozen because updating it was “on the backlog.”

Documentation without enforcement is fiction. Schemas describe intent; contract tests verify reality.

How Consumer-Driven Contracts Work

Traditional API testing puts the provider in charge. The provider defines a schema, publishes documentation, and consumers build against it. If the provider changes the API, consumers find out when their code breaks — often in production.

Consumer-driven contracts invert this model. The consumer defines what it needs from the provider and encodes those expectations in a contract. The provider then verifies it can satisfy that contract. Both sides test against the same artifact, so compatibility is verified before either side deploys.

Pact is the most widely-used contract testing framework, with libraries for JavaScript, Java, Python, Go, and more. The Pact Broker is a separate service that stores contracts and tracks verification results across all your services.

Contract testing workflow showing consumer-first approach description

Sequence diagram showing the consumer writing a test, generating a contract, and publishing it to the Pact Broker, followed by the provider fetching the contract, replaying it against the real API, publishing verification results, and the broker tracking compatibility across service versions.

Here’s what this looks like in practice. A consumer test defines the expected request and response for an API call:

import { PactV3, MatchersV3 } from '@pact-foundation/pact';
tags: ["apis-and-gateways", "docker", "kubernetes", "typescript", "python", "go", "ruby", "dotnet"]
// like() matches type/structure, not exact values — keeps contracts flexible

const provider = new PactV3({
  consumer: 'OrderService',
  provider: 'UserService',
});

describe('User API Contract', () => {
  it('returns user details for valid ID', async () => {
    await provider
      .given('user 123 exists')
      .uponReceiving('a request for user 123')
      .withRequest({
        method: 'GET',
        path: '/users/123',
      })
      .willRespondWith({
        status: 200,
        body: {
          userId: like('123'),
          email: like('user@example.com'),
        },
      })
      // Pact spins up a mock server; consumer tests run against it
      .executeTest(async (mockServer) => {
        const client = new UserApiClient(mockServer.url);
        const user = await client.getUser('123');
        expect(user.userId).toBeDefined();
      });
  });
});

Consumer contract test defining expected API behavior.

Notice the like() matcher wrapping values. The contract tests structure, not exact values. The consumer expects a userId field containing a string, but doesn’t care if it’s “123” or “abc-456”. This loose matching keeps contracts maintainable — they verify the shape of responses without becoming brittle assertions on test data.

When this test runs, Pact spins up a mock server that returns the expected response. The consumer’s actual API client code runs against this mock, validating that the client correctly handles the response format. Pact then generates a JSON contract file capturing the interaction.

That contract gets published to a Pact Broker, where the provider fetches it and replays the requests against its real implementation. If the provider’s response doesn’t match the expected structure, the provider’s build fails — before deployment, before it reaches any consumer.

The contract becomes a living artifact that both sides test against. Consumers can’t expect something they haven’t declared. Providers can’t break something they’ve verified they support.

newsletter.subscribe

The Deployment Safety Net

Contracts alone don’t prevent bad deployments. The real power comes from tracking which versions are compatible and blocking deployments that would break compatibility.

The Pact Broker maintains a matrix of every consumer version, every provider version, and whether they’ve been verified as compatible. The can-i-deploy command queries this matrix before any deployment:

# Before deploying UserService v2.3.1 to production
pact-broker can-i-deploy \
  --pacticipant UserService \
  --version 2.3.1 \
  --to-environment production

This command answers: “If I deploy UserService v2.3.1 to production, will it break any consumer that’s currently deployed there?” Not “probably,” not “I think so”—a definitive yes or no based on actual verification results.

When a provider makes an incompatible change, they discover it immediately. The provider’s CI runs verification against all consumer contracts, and any contract violation fails the build with a clear message identifying which consumer would break and why.

Contract violation detected at PR time, not production description

Flowchart showing a provider code change triggering CI to fetch consumer contracts, then branching on verification success: successful verification leads to a successful can-i-deploy check and safe deployment, while failure breaks the build, reports that OrderService expects userId, and sends the developer to fix the change or coordinate with consumers.

This is the fundamental shift: from discovering API incompatibilities in staging (or worse, production) to catching them at PR time. The provider team doesn’t need to manually coordinate with every consumer team. They don’t need to check a wiki or ask in Slack. The contracts encode what consumers actually use, and CI enforces compatibility automatically.

The deployment safety isn’t theoretical. When the UserService team tries to rename userId to user_id, their build fails immediately. The error message tells them exactly which services would break: OrderService, ShippingService, and AnalyticsService all expect userId. They can make an informed decision — coordinate with those teams, version the API, or reconsider the change entirely — before any code reaches production.

The can-i-deploy command moves deployment decisions from “hope and pray” to “verified compatibility.” Your CI pipeline becomes the enforcement mechanism, not late-night Slack messages.

Where to Go From Here

Contract testing fills the gap between fast unit tests and slow integration tests. It catches the specific category of bugs that matter most in distributed systems: API incompatibilities between services that only manifest when deployed together.

Free PDF Guide

Download the Contract Testing Guide

Get the complete implementation playbook for consumer-driven contracts, provider verification, and safe multi-service deployments.

What you'll get:

Pact workflow setup checklist
Matcher design best practices
Provider state management guide
Can-i-deploy policy runbook

Free resource

Instant access

Download Now

Learn More

No credit card required.

The approach requires some organizational shift. Consumer teams must write contracts that capture their actual dependencies. Provider teams must run verification as part of their CI pipeline. Both sides must publish results to a shared broker. But the payoff is substantial: confident deployments without coordinating across every team, without maintaining heavyweight test environments, and without discovering breaks in production.

The userId incident that opened this article? With contract testing in place, it becomes a failed PR instead of a weekend incident. The build breaks, the developer sees which consumers would be affected, and they make an informed choice about how to proceed. That’s the difference between integration tests and contracts: one tells you something broke, the other tells you before you break it.

Enjoyed the read? Share it with your network.

Table of Contents

Download the Contract Testing Guide

Your Rate Limiter Is Your Biggest Outage Risk

Why Your Traces Are Unreadable: Span Design

Terraform Module Defaults That Won't Break Your Consumers

Why Your E2E Tests Are Flaky (And How to Fix Them)

How We Cut Preview Environment Costs by 60 Percent

Table of Contents

The Integration Test Trap

How Consumer-Driven Contracts Work

The Deployment Safety Net

Where to Go From Here

Download the Contract Testing Guide

Share this article

Your Rate Limiter Is Your Biggest Outage Risk

Why Your Traces Are Unreadable: Span Design

Terraform Module Defaults That Won't Break Your Consumers

Why Your E2E Tests Are Flaky (And How to Fix Them)

How We Cut Preview Environment Costs by 60 Percent