High availability (HA) means designing systems to remain operational despite failures — through redundancy, multi-AZ deployment, auto-recovery, and eliminating single points of failure. It's a fundamental goal for production systems and a key area of AWS architecture.
Core HA principles
✓ ELIMINATE SINGLE POINTS OF FAILURE — no single component whose failure takes down
the system → redundancy everywhere (multiple instances, AZs, etc.)
✓ Deploy across MULTIPLE AVAILABILITY ZONES — survive an AZ (data center) failure
✓ AUTOMATIC RECOVERY — detect failures and recover/replace automatically (no manual fix)
✓ DECOUPLE components — failures isolated; one component's failure doesn't cascade
