Professional Data Engineer Exam QuestionsBrowse all questions from this exam

Professional Data Engineer Exam - Question 249


Your team is building a data lake platform on Google Cloud. As a part of the data foundation design, you are planning to store all the raw data in Cloud Storage. You are expecting to ingest approximately 25 GB of data a day and your billing department is worried about the increasing cost of storing old data. The current business requirements are:

• The old data can be deleted anytime.

• There is no predefined access pattern of the old data.

• The old data should be available instantly when accessed.

• There should not be any charges for data retrieval.

What should you do to optimize for cost?

Show Answer
Correct Answer: AC

To optimize for cost while meeting all the business requirements, create the bucket with the Autoclass storage class feature. Autoclass automatically moves objects between storage classes without impacting performance or availability and does not incur retrieval costs. This feature ensures that old data is moved to the most cost-effective storage class based on access patterns, without any need for manual lifecycle management. Additionally, it guarantees instant availability of the data and no retrieval charges, aligning perfectly with the given requirements.

Discussion

7 comments
Sign in to comment
Smakyel79Option: A
Jan 7, 2024

https://cloud.google.com/storage/docs/autoclass

raaadOption: A
Jan 5, 2024

- Autoclass automatically moves objects between storage classes without impacting performance or availability, nor incurring retrieval costs. - It continuously optimizes storage costs based on access patterns without the need to set specific lifecycle management policies.

Sofiia98Option: A
Jan 9, 2024

For sure A, read the documentation

therealsohailOption: B
Jan 6, 2024

Create an Object Lifecycle Management policy to modify the storage class for data older than 30 days to nearline, 90 days to coldline, and 365 days to archive storage class. Delete old data as needed.

GCP001Option: A
Jan 9, 2024

autoclass is the correct way to handle all business cases

CGS22Option: B
Apr 8, 2024

Why B is the best choice: Cost Optimization: This option leverages Cloud Storage's different storage classes to significantly reduce costs for storing older data. Nearline, coldline, and archive storage classes are progressively cheaper than the standard storage class, with trade-offs in availability and retrieval times. Meets Requirements: Old data deletion: You can manually delete old data whenever needed, fulfilling the first requirement. No predefined access pattern: The policy automatically transitions data to cheaper storage classes based on age, regardless of access patterns. Instant availability: Nearline storage provides immediate access to data, meeting the third requirement. No retrieval charges: While there are retrieval charges for coldline and archive storage, nearline storage has no retrieval fees, satisfying the fourth requirement.

hussain.sainOption: B
Jul 15, 2024

the question clearly specifies there should not be any retrieval charges. so enabling autoclass is not recommended because we have to pay one time fees while retrieving the data. and usually soft delete is enable.