Answer D.
for the scaler to kick in, it needs to be 15 minutes of average use above x % that is set. so once the load starts, it will wait 15 minutes until that rule is met, and a scale up will happen.
once it scales up at the 15 minute mark, the 5 minute cooldown starts.
at 20 minute mark, the scaler will LOOK BACK at the PAST 15 minutes to see if during that time the usage was still higher than x% and scale up again.
then it will wait another 5 minutes, check the PAST 15 minutes again, scale up again, etc. etc.
the previous performance data doesnt get thrown out after each scale, it keeps on sliding the dataset its checking forward.
imagine if this was a real app and you get hit by huge traffic surge. like your company release a new product. what this would do in your method is scale up one instance every 20 minutes, so it might mean multiple hours of degraded performance while it catches up to demand, while the reality and my explanation, it would kick in after 15 minutes and then keep scaling every 5 minutes which is a much faster and reasonable solution.