What is Integration Testing?

Integration testing verifies that a group of components work together correctly — a service with its database, an API endpoint with its downstream calls, a microservice with its message queue. Integration tests exercise real interactions and catch the class of bug that unit tests cannot: contract mismatches at component boundaries, which is where the majority of production bugs actually live.

Integration tests vs unit tests

Unit tests isolate a single function or class and mock every dependency. They run in milliseconds and catch logic bugs in the code you wrote. Integration tests exercise real dependencies — a real Postgres instance, a real Redis, a real downstream HTTP service — and catch bugs in the contract between your code and those dependencies. Both are necessary, for different reasons.

Dimension	Unit test	Integration test
Scope	One function or class	Multiple components
Dependencies	Mocked	Real (or recorded)
Speed	Milliseconds	Seconds
Catches	Logic bugs	Contract and boundary bugs
Count in a typical service	Hundreds to thousands	Dozens to hundreds

Integration testing tools

Three generations of tools cover different trade-offs:

Framework-native. pytest fixtures, JUnit, Mocha with test containers — integration tests live alongside unit tests in the same codebase. Easy to start, manual assertion writing for every test.
Containerization. Testcontainers spins up real Postgres, Redis, Kafka per test run so tests hit the real thing without sharing state. Realistic but slow and resource-intensive.
Traffic capture. Keploy, WireMock record-playback, Hoverfly record real API interactions and replay them as deterministic tests with auto-generated mocks for downstream dependencies. Fastest to set up and fastest to run at steady state.

The non-determinism problem

The hardest problem with integration testing is non-determinism. Tests that hit real databases, real timestamps, and real distributed tracing contexts produce different results on every run — failing assertions when nothing is actually broken. Teams that hit this problem either (a) adopt aggressive mocking, which defeats the point of integration testing, or (b) quarantine flaky tests until the whole suite loses credibility.

Keploy solves this with a deterministic replay engine that normalizes non-deterministic fields (timestamps, UUIDs, session tokens, pagination cursors) at comparison time. A test passes as long as the replayed response has the same structural shape and the same deterministic fields as the original capture. See Noise & Secret Management for the full mechanism.

FAQs

Integration testing is the practice of testing a group of components together to verify they work correctly as a unit — a service with its database, an API endpoint with its downstream calls, a microservice with the message queue it publishes to. Unlike unit tests that isolate a single function, integration tests exercise real interactions and catch bugs in the boundaries between components, which is where the majority of production bugs live.

Unit tests isolate a single function or class by mocking every dependency — they run in milliseconds and catch logic bugs in your code. Integration tests exercise real dependencies (databases, queues, downstream services) and catch bugs in the contract between your code and the things it talks to. The usual guidance: many unit tests at the bottom of the testing pyramid for fast feedback on logic, and a smaller number of integration tests for the critical service boundaries where real-world failures happen.

Three generations. Framework-native tools (pytest with fixtures, JUnit, Mocha with test containers) let you write integration tests alongside unit tests. Containerization tools (Testcontainers, Dockerize) spin up real databases and queues per test run so tests hit the real thing. Traffic-capture tools (Keploy, WireMock record-playback) record real API interactions and replay them as deterministic tests with auto-generated mocks for downstream dependencies.

Non-determinism. Integration tests that hit real databases, real timestamps, and real distributed tracing contexts produce different results on every run — failing assertions when nothing is actually broken. Fixing this requires either aggressive mocking (which defeats the point of integration testing) or deterministic replay engines that normalize non-deterministic fields at comparison time. See /noise-secret-management for how Keploy handles the deterministic-replay problem.

Keploy captures real HTTP, gRPC, WebSocket, and database traffic flowing through your running service using eBPF at the Linux kernel level. Every captured request becomes a YAML test case with auto-generated mocks for every downstream dependency — databases, message queues, external APIs. There is no SDK, no proxy, and no application code changes. A 10-30 minute capture session in staging produces a complete integration-test suite covering whatever endpoints the traffic exercised, typically reaching 90% coverage.

Yes, ideally. The point of integration testing is catching boundary bugs before they reach production. Running the suite only on main means regressions ship to main before anyone notices. The common counter-argument is that integration tests are slow — which is true for suites that spin up containers for every test. Keploy addresses this by replaying captured traffic against the running service with auto-mocked dependencies, which runs in seconds, not minutes.

Test with Keploy AI

Get the power of AI to your Testing Pipelines!

Join our GlobalCommunity

Connect with developers worldwide. Follow updates, ask questions, share feedback, and ship faster with other Keploy builders.

1.2M+Installs

18.4K+GitHub

100K+Devs

300M+Mocks

1K+Contributors

#1OSS Trending

4.9★★★★★from 500+ reviews on