Build Auto Scaling target tracking, step scaling, and predictive scaling policies.
Last verified: May 2026
Build Auto Scaling target tracking, step scaling, and predictive scaling policies.
Required Fields
AutoScalingGroupNamePolicyNamePolicyTypeOutput will appear here...Auto Scaling policies determine how and when EC2 Auto Scaling groups add or remove instances in response to demand. AWS offers three policy types: target tracking (maintain a metric at a target value), step scaling (scale in discrete steps based on alarm thresholds), and predictive scaling (use machine learning to forecast demand and pre-scale). Choosing the right policy type and configuring the scaling parameters correctly is critical for balancing performance with cost. The Auto Scaling Policy Builder helps you configure all three policy types with proper metric selections, cooldown periods, and scaling adjustments.
Your team's 30-instance Auto Scaling group has been oscillating: scale-out to 60 instances during morning peak, then thrashing as instances finish boot just as load drops. The builder helps configure: target tracking on average CPU at 60% target, instance warm-up of 180 seconds (your Java app's real ready time), plus predictive scaling for the known 9am peak. After deploy, scaling becomes smooth — no more 60-instance overshoots, no more stampedes. Average instance count drops from 38 to 24, saving ~$3K/month while improving response time consistency.
Target tracking is the right default for 95% of workloads. The other policy types feel more powerful but add complexity that's rarely justified. Step scaling is genuinely useful for asymmetric scaling (aggressive scale-out, gradual scale-in to avoid hammering downstream); predictive scaling needs at least 14 days of consistent traffic patterns to work well.
Instance warm-up time MUST match your application's actual time to start serving traffic at full capacity (not just `systemctl start`). For Java apps with JIT compilation, that's often 90-180 seconds. Setting it too low triggers cascading scale-outs; too high causes capacity gaps during real spikes.
Predictive scaling pre-warms capacity 15 minutes BEFORE a forecasted demand peak. For traffic that ramps up sharply at a known time (9am login storm, daily report runs), this eliminates the 'always 5 minutes behind' pain of reactive scaling. Combine with target tracking as a safety net for unforecasted spikes.
The builder generates Auto Scaling policies for each type: target tracking (specify metric type, target value, optional disable scale-in), step scaling (alarm thresholds with scaling adjustments), or predictive scaling (metric specification, mode: ForecastAndScale or ForecastOnly, scheduling buffer time). Output is generated as aws autoscaling put-scaling-policy commands and Terraform aws_autoscaling_policy resources, plus the underlying CloudWatch alarms for step scaling.
Was this tool helpful?
Disclaimer: This tool runs entirely in your browser. No data is sent to our servers. Always verify outputs before using them in production. AWS, Azure, and GCP are trademarks of their respective owners.