03-03 Fault Tolerance
A Fault Domain is a section of a network that is vulnerable to damage if a critical device or system fails. The purpose of a fault domain is that if a failure occurs it will not cascade outside that domain, limiting the damage if possible. ^1275f6
You can have fault domains nested fault domains.
A fault level is a collection of fault domains. ^d4ef9c
The scope of a fault domain could be:
- specific servers in a rack
- an entire rack in a datacenter
- the entire data center building
Its up to the CSP to define the boundaries of a domain.
E.g.
an
AWS Region would be a
Fault Level.
an
Availability Zone would be a
Fault Domain.