Exam Certified Data Engineer Professional All QuestionsBrowse all questions from this exam
Question 78

The data engineering team maintains the following code:

Assuming that this code produces logically correct results and the data in the source table has been de-duplicated and validated, which statement describes what will occur when this code is executed?

    Correct Answer: C

    When this code executes, it performs a batch aggregation operation on the 'silver_customer_sales' table grouped by 'customer_id'. It calculates the first and last transaction dates, the average sales, the total number of distinct orders, and the lifetime value of sales for each customer. The .mode('overwrite') operation specifies that the output table 'gold_customer_lifetime_sales_summary' should be overwritten with the result of this aggregation. This means that every time this code runs, it replaces the existing 'gold_customer_lifetime_sales_summary' table with a new version that reflects the aggregate calculations of the 'silver_customer_sales' table.

Discussion
sturcuOption: C

it does overwrite, so no incremental load

aragorn_bregoOption: C

The code is performing a batch aggregation operation on the "silver_customer_sales" table grouped by "customer_id". It calculates the first and last transaction dates, the average sales, the total number of distinct orders, and the lifetime value of sales for each customer. The .mode("overwrite") operation specifies that the output table "gold_customer_lifetime_sales_summary" should be overwritten with the result of this aggregation. This means that every time this code runs, it will replace the existing "gold_customer_lifetime_sales_summary" table with a new version that reflects the current state of the "silver_customer_sales" table.

mouad_attaqiOption: C

Correct Answer is C, it is an overwrite mode

hal2401meOption: C

C. there's nowhere implicating streaming.

DileepvikramOption: C

C is the answer