Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 7


A data engineer has realized that they made a mistake when making a daily update to a table. They need to use Delta time travel to restore the table to a version that is 3 days old. However, when the data engineer attempts to time travel to the older version, they are unable to restore the data because the data files have been deleted.

Which of the following explains why the data files are no longer present?

Show Answer
Correct Answer: A,C

The VACUUM command in Delta Lake is used to clean up and remove unnecessary data files that are no longer needed for time travel or query purposes. When you run VACUUM with certain retention settings, it can delete older data files, which might include versions of data that are older than the specified retention period. If the data engineer is unable to restore the table to a version that is 3 days old because the data files have been deleted, it is likely because the VACUUM command was run on the table, removing the older data files as part of data cleanup.

Discussion

17 comments
Sign in to comment
Feroz_RazaOption: A
Nov 16, 2023

There is no DELETE HISTORY command in Databricks VACCUM command can remove history and we can also specify the retention period with VACCUM Command. Default Retention period is 7 days. To allow changing the default retention period you can rum the following command ALTER TABLE your_table SET TBLPROPERTIES ('delta.retentionDurationCheck.enabled' = 'true');

vctrhugoOption: A
Sep 3, 2023

A. The VACUUM command was run on the table The VACUUM command in Delta Lake is used to clean up and remove unnecessary data files that are no longer needed for time travel or query purposes. When you run VACUUM with certain retention settings, it can delete older data files, which might include versions of data that are older than the specified retention period. If the data engineer is unable to restore the table to a version that is 3 days old because the data files have been deleted, it's likely because the VACUUM command was run on the table, removing the older data files as part of data cleanup.

KalavathiPOption: A
Sep 26, 2023

A is correct

HuroyeOption: A
Nov 17, 2023

I agree with the first post. A is the correct answer. There is no such thing as a Delete History Command

cpalmierOption: A
Aug 23, 2023

A is Correct! Does DELETE HISTORY command exist?

Sriramiyer92Option: A
Oct 1, 2023

Reading Material: https://learn.microsoft.com/en-us/azure/databricks/delta/vacuum#example-syntax-for-vacuum

VijayKulaOption: A
Oct 9, 2023

Answer is A Vaccum

vivekrrrOption: A
Oct 30, 2023

but vaccum allows to vaccum anything that's older than 7 days right

vivekrrrOption: B
Oct 30, 2023

i think B is the answer, plz let me know if not correct

awofalusOption: A
Nov 7, 2023

right answer is A

SerGreyOption: A
Dec 24, 2023

A i correct

ItmmaOption: A
Mar 19, 2024

A is correct

bettermakemeOption: A
Mar 25, 2024

Answer is A. Just finished exam-got 100% [Databricks Associate Exam Practice Exams] All questions came from Databricks Certified Data Engineer Associate https://www.udemy.com/share/10aEFa3@9M_uT6vrKbnl68tOK96kfy-YWitjwzLTlVCrzPs-0hGUu8fyX8V4Tn_x_y65bwLm/

mascarenhaslucasOption: A
Jun 9, 2024

The answer is A!

potaryxkugOption: A
Jun 16, 2024

A is the good answer

3fbc31bOption: A
Jul 8, 2024

There is no DELETE HISTORY command anywhere in Databricks. The VACUUM command removes files older than the value that is set. The default value is 7 days.

ranjan24Option: A
Jul 17, 2024

A is the correct answer