Scaling Policies and Decision Strategies

Auto Scaling behavior is defined by scaling policies.

There are three main strategies:

You define a metric target (e.g., CPU at 50%). AWS automatically adjusts capacity to maintain that target.

This is the recommended approach for most systems.

Architecturally, this creates a self-balancing system that maintains steady performance.

Scaling actions occur in steps depending on metric breach levels.

Example:

This gives fine-grained control but requires careful tuning.

Used for predictable traffic patterns.

Example:

Useful for business-hour workloads.

Scaling is not instant. New instances need:

If scaling decisions ignore warmup time, you risk oscillation (rapid scaling in and out).

Production insight: Always configure instance warmup and use load balancer health checks.

Improper scaling policies cause:

Well-designed policies:

Scaling policy design is part of system reliability engineering.

Question 1 of 2

0/2

Which scaling policy automatically maintains a metric at a defined target?

0 of 4 completed

Choose your language