Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 39


Which of the following is true of Delta Lake and the Lakehouse?

Show Answer
Correct Answer: B

Delta Lake automatically collects statistics on the first 32 columns of each table which are leveraged in data skipping based on query filters. This helps to optimize query performance by allowing the system to quickly determine which parts of the dataset do not need to be read based on the query conditions.

Discussion

8 comments
Sign in to comment
PatitoOption: B
Dec 29, 2023

B is correct since statistics are collected for the first 32 columns and stored in the transaction log.

guillesdOption: B
Feb 5, 2024

B is correct

chokthewaOption: B
Oct 23, 2023

B is correct. https://docs.delta.io/2.0.0/table-properties.html

f728f7fOption: C
Dec 20, 2023

C is correct

ervinshangOption: B
Dec 27, 2023

B is correct, C is error, con't have new cache in view

Crocjun
Jan 4, 2024

Can anyone explain why D is not correct?

cryptoflam
Jan 5, 2024

Because Primary & Foreign Key information is not enforced. "Primary and foreign keys are informational only and are not enforced" from: https://docs.databricks.com/en/tables/constraints.html#declare-primary-key-and-foreign-key-relationships

spaceexplorerOption: B
Jan 24, 2024

B is correct

PrashantTiwariOption: B
Feb 9, 2024

B is correct