Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 5


Which of the following describes the storage organization of a Delta table?

Show Answer
Correct Answer: C

Delta tables are stored in a collection of files that contain data, history, metadata, and other attributes. This organization allows Delta tables to maintain a structured manner of storing data using formats like Parquet files for the actual data, while also keeping track of metadata and transaction logs in separate directories. This structure supports features such as versioning, transaction management, and metadata tracking, which are essential for Delta tables to provide ACID transactions and other advanced functionalities.

Discussion

14 comments
Sign in to comment
Data_4everOption: C
Apr 7, 2023

C is the right option

prasiosoOption: C
May 12, 2023

First selected D as I assumed the data to be stored in the Delta lake and the transaction log to be stored separately. However, documentation states when a user creates a Delta Lake table, that table’s transaction log is automatically created in the _delta_log subdirectory. The deltalog contains multiple files hence a collection of files. Answer C.

vctrhugoOption: C
Sep 3, 2023

C. Delta tables are stored in a collection of files that contain data, history, metadata, and other attributes. Delta tables store data in a structured manner using Parquet files, and they also maintain metadata and transaction logs in separate directories. This organization allows for versioning, transactional capabilities, and metadata tracking in Delta Lake. Thank you for pointing out the error, and I appreciate your understanding.

benni_aleOption: C
Apr 3, 2024

GPT4: Delta tables in Databricks use: Parquet format files for data storage. A _delta_log folder for JSON log files that track transactions. Scheme enforcement in metadata to ensure consistency. Checkpoint files to speed up the rebuilding of the table state.

XiltroXOption: C
Apr 1, 2023

C is correct answer https://docs.delta.io/latest/delta-faq.html#:~:text=Delta%20Lake%20uses%20versioned%20Parquet,directory%20to%20provide%20ACID%20transactions.

AtnafuOption: C
Jul 8, 2023

C Delta tables in Databricks Delta Lake are stored in a collection of files organized in a directory structure. This directory structure includes data files, transaction log files, and metadata files. These files are stored in a specified location, typically in a distributed file system such as Hadoop Distributed File System (HDFS) or Amazon S3.

andie123Option: C
Aug 16, 2023

C is the right answer

Sriramiyer92
Oct 1, 2023

Reading Material: 5 reasons to choose Delta format (on Databricks) https://medium.com/datalex/5-reasons-to-use-delta-lake-format-on-databricks-d9e76cf3e77d

kniveszOption: C
Apr 3, 2023

C , respuesta correcta

KalavathiPOption: C
Sep 26, 2023

Correct ans C

VijayKulaOption: C
Oct 9, 2023

Answer is C

SerGreyOption: C
Dec 24, 2023

C is correct

ItmmaOption: C
Mar 19, 2024

C is correct

mascarenhaslucasOption: C
Jun 9, 2024

The answer is C!