Auto Scaling automatically adjusts the number of running resources (e.g. EC2 instances) based on demand — adding capacity when load increases and removing it when load decreases. It's key to building applications that are both reliable (handle traffic spikes) and cost-efficient (don't over-provision).
What auto scaling does
Auto Scaling adjusts capacity AUTOMATICALLY based on demand:
→ load INCREASES (traffic spike) → add instances (scale OUT) → handle the load
→ load DECREASES → remove instances (scale IN) → save money
→ Match capacity to demand automatically: reliable under load, cheap when idle.
