Exam Certified Data Engineer Professional All QuestionsBrowse all questions from this exam
Question 39

Which of the following is true of Delta Lake and the Lakehouse?

    Correct Answer: B

    Delta Lake automatically collects statistics on the first 32 columns of each table which are leveraged in data skipping based on query filters. This helps to optimize query performance by allowing the system to quickly determine which parts of the dataset do not need to be read based on the query conditions.

Discussion
guillesdOption: B

B is correct

PatitoOption: B

B is correct since statistics are collected for the first 32 columns and stored in the transaction log.

PrashantTiwariOption: B

B is correct

spaceexplorerOption: B

B is correct

Crocjun

Can anyone explain why D is not correct?

cryptoflam

Because Primary & Foreign Key information is not enforced. "Primary and foreign keys are informational only and are not enforced" from: https://docs.databricks.com/en/tables/constraints.html#declare-primary-key-and-foreign-key-relationships

ervinshangOption: B

B is correct, C is error, con't have new cache in view

f728f7fOption: C

C is correct

chokthewaOption: B

B is correct. https://docs.delta.io/2.0.0/table-properties.html