Professional Cloud Architect Exam - Question 23

Question

Your solution is producing performance bugs in production that you did not see in staging and test environments. You want to adjust your test and deployment procedures to avoid this problem in the future.

What should you do?

Examice · Accepted Answer

To address performance bugs that appear in production but not in staging and test environments, it is crucial to make the test and staging environments as similar to production as possible. Increasing the load on test and staging environments helps simulate the real-world conditions your application will face, revealing performance issues that may not surface under lighter or less varied loads. This approach targets the root cause of the issue by ensuring your tests are as robust and realistic as the production environment. Deploying smaller changes or fewer changes to production does not directly address the testing environment discrepancies. While canary deployments can help catch some issues with a smaller user base, they may not fully replicate the performance stress seen in full production, making them insufficient for this specific problem.

ghitesh · Answer

Question Statement: You want to adjust your test and deployment procedures to avoid this problem in the future

So based on this, I think the option "C" is correct, since it is the only one talking about doing changes in the test environment.

Eroc · Answer

A wouldn't prevent the bugs, it would just avoid them. B would help with root-cause analysis because it'd be a smaller change to review. C would test the performance of the system at its peak processing rates, so this assumes the bugs in production only occur because of usage. D would allow you to test the new code against smaller user sets to see if it occurs then, and if it still does you know it is not because of more user responses. So it's a tossup between C and D, D would be the cheaper/quicker answer so I'd choose D first then C if it's because of usage.

MahAli · Answer

Canary deployment is perfect to test new feature but to do stress testing, I do development for 25 years, when we want to resolve performance and scalability issues we do stress and load testing in pre prod environment, something you can't do by exposing the new feature to subset of users.

jrisl1991 · Answer

I'm going for D. According to ChatGPT:

D. The best approach to avoid performance bugs in production that weren't detected in staging and test environments is to gradually roll out changes to a small subset of users before deploying them to production. This way, you can identify and address any issues that may arise in a controlled environment before affecting all users.

Why not C: Increasing the load on your test and staging environments, as suggested in option C, can be a valuable strategy for detecting certain types of performance issues related to scalability and load handling. However, it may not address all types of performance bugs or issues that are specific to the production environment.

A and B seem a bit obviously wrong. A is incorrect because it would be incompatible with CI/CD, and deploying fewer changes may still introduce bugs regardless of the low frequency. B would help root causing, but may also introduce new bugs that are exclusive to production.

Haigk · Answer

Without overthinking the wording, canary (and similar) deployment methodologies are often recommended in Google documentation, whereas increasing load in dev environments aren't. (My $0.02...)

Johnny_K35 · Answer

It says bugs "that you did not see in staging and test environments" so should be D

Gregwaw · Answer

From the question we don't know anything about reasons of performance problems in production environemnt. We also don't know anything about tests that were performed in test in staging environments. There is no reason to believe that load in test environment is not sufficient. It is always possible that some performance problems occurs only in production environment. It is also not economically reasonable to reproduce full production load in test environemnt. Taking all of this into account I am incling to answer D.

Patrick2708 · Answer

C looks good to me

DA95 · Answer

Answer is D
https://www.devopsdigest.com/bugs-in-production-how-to-avoid-unpleasant-surprises

rescolar · Answer

Bard says it's C so everyone saying D back off inmediately.

duzapo · Answer

C if you test with a subset group of users you cant test the performance properly

yilexar · Answer

It is really about Canary deployment

Prakzz · Answer

It talks about test and deployment Procedures NOT Environment. Answer is D

MarekPL · Answer

as for C: synthetic load may not cover all scenarios. For D obviously we need to have monitoring in place to see if e.g. system load or response times increased after canary deployment

santoshchauhan · Answer

D. Deploy changes to a small subset of users before rolling out to production.

This approach, known as canary releasing or canary deployment, involves rolling out changes to a small group of users before deploying them to the entire user base. It is a very effective way to catch performance issues that might not have been apparent during testing.
C. Increase the load on your test and staging environments: This is definitely a good practice, as it can help simulate production-like conditions more closely. However, it may still not capture all real-world scenarios and user behaviors that can lead to performance issues.

Robert0 · Answer

Although all answers can be good practices, I think only option C address the problem described.

a2le · Answer

Me too! I can't see how a "performance bug" might be mitigated via a canary deployment.
However, I see that C doesn't cover the "deployment" part of the question, then I deduce that the question is ambiguously formulated.

Professional Cloud Architect Exam - Question 23

Discussion