Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 52


In which of the following file formats is data from Delta Lake tables primarily stored?

Show Answer
Correct Answer: C

Data from Delta Lake tables is primarily stored in Parquet file format. This columnar storage format is optimal for query performance and data compression, supporting advanced features like partition pruning which makes it faster than formats like CSV.

Discussion

7 comments
Sign in to comment
kishore1980Option: C
Oct 30, 2023

Parquet format is correct

meow_akkOption: C
Oct 22, 2023

so i think data from delta lake is stored in parquet format .. while the storage format seems to be delta .. very confusing some notes : What format does Delta Lake use to store data? Delta Lake uses versioned Parquet files to store your data in your cloud storage. Apart from the versions, Delta Lake also stores a transaction log to keep track of all the commits made to the table or blob store directory to provide ACID transactions. https://docs.delta.io/latest/delta-faq.html

SD5713Option: B
Oct 24, 2023

parquet format

kishanu
Oct 28, 2023

Buddy it should be Parquet, hence C

kishanuOption: C
Oct 28, 2023

Parquet it is

nedloOption: C
Dec 5, 2023

Parquet format because its columnar format, much faster alternative to CSV because it supports partition pruning for example. No such file format as "Delta"

azure_bimonsterOption: C
Jan 20, 2024

Parquet

benni_aleOption: C
Apr 29, 2024

Parquet for data and JSON for metadata