Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 142


The data engineering team maintains the following code:

Assuming that this code produces logically correct results and the data in the source tables has been de-duplicated and validated, which statement describes what will occur when this code is executed?

Show Answer
Correct Answer: B

The code uses the .write.mode('overwrite') method when writing the final DataFrame to the table 'enriched_itemized_orders_by_account'. This method will overwrite the existing table with the new data, thus ensuring the table is completely updated with the latest data from the three source tables ('accounts', 'orders', 'items'). Therefore, the enriched_itemized_orders_by_account table will be overwritten using the current valid version of data in each of the three tables referenced in the join logic.

Discussion

1 comment
Sign in to comment
hpkrOption: B
Jun 12, 2024

B is correct