Fault tolerance

Fault tolerance image

What is fault tolerance?

Fault tolerance is a system's ability to keep running when something goes wrong. Think of it as a safety net that prevents failures in one part of the system from crashing the whole thing. When components fail, a fault-tolerant system adapts and maintains its core functions, often without users even noticing the problem. 

Do you have any examples of fault tolerance? 

In your daily testing work, you'll encounter fault tolerance in many forms. When you're testing a distributed database, you might intentionally shut down one of the database servers and verify that the application still retrieves data seamlessly from the backup servers. 

Another common example is load balancing in web applications—if one web server fails, traffic automatically routes to healthy servers. Modern cloud platforms like AWS and Azure build in fault tolerance through availability zones, so if an entire data center goes down, your application can keep running from another location. 

Why is fault tolerance important? 

For testers, fault tolerance is crucial because it directly impacts the reliability of the systems we validate. When a production system goes down, companies can lose millions in revenue, damage their reputation, or even put users at risk. 

What are the challenges with fault tolerance? 

Testing fault-tolerant systems is complex because you need to verify both normal operations and failure scenarios. You'll face challenges like setting up realistic test environments that can simulate various types of failures, coordinating tests across multiple redundant systems, and verifying that failover mechanisms work correctly under load. 
The membership for software testing professionals image
Join thousands of other software testers in levelling up their careers
Explore MoT
TestBash Brighton 2025 image
Wed, 1 Oct
On the 1st & 2nd of October, 2025 we'll be back to Brighton for another TestBash: the largest software testing conference in the UK
MoT Foundation Certificate in Test Automation
Unlock the essential skills to transition into Test Automation through interactive, community-driven learning, backed by industry expertise
This Week in Testing
Debrief the week in Testing via a community radio show hosted by Simon Tomes and members of the community
Subscribe to our newsletter
We'll keep you up to date on all the testing trends.