Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 29


Which of the following describes the relationship between Bronze tables and raw data?

Show Answer
Correct Answer: CE

Bronze tables contain raw data with a schema applied. Unlike raw data, which typically lacks structure, Bronze tables in a data lakehouse architecture provide a structured format by applying a schema to the raw data. This schema helps in organizing the data, making it more accessible and easier to work with for downstream processing.

Discussion

11 comments
Sign in to comment
XiltroXOption: E
Apr 1, 2023

Bronze tables are basically raw ingested data, often with schema borrowed from the original data source or table. Correct answer is E.

surrabhi_4Option: E
Apr 3, 2023

Option E

akk_1289Option: E
Jul 21, 2023

Ans : E The Bronze layer is where we land all the data from external source systems. The table structures in this layer correspond to the source system table structures "as-is," along with any additional metadata columns that capture the load date/time, process ID, etc. The focus in this layer is quick Change Data Capture and the ability to provide an historical archive of source (cold storage), data lineage, auditability, reprocessing if needed without rereading the data from the source system. https://www.databricks.com/glossary/medallion-architecture#:~:text=Bronze%20layer%20%28raw%20data%29

rafahbOption: E
Apr 4, 2023

E option

vctrhugoOption: E
Sep 3, 2023

E. Bronze tables contain raw data with a schema applied. In a typical data processing pipeline following a "Bronze-Silver-Gold" data lakehouse architecture, Bronze tables are the initial stage where raw data is ingested and transformed into a structured format with a schema applied. The schema provides structure and meaning to the raw data, making it more usable and accessible for downstream processing. Therefore, Bronze tables contain the raw data but in a structured and schema-enforced format, which makes them distinct from the unprocessed, unstructured raw data files.

DavidRouOption: E
Oct 31, 2023

E is the right answer. Bronze data are simply a more structured (in terms of schema) version of raw data to be found in the "landing area".

SerGreyOption: E
Jan 4, 2024

Correct is E

AtnafuOption: E
Jul 8, 2023

E Bronze tables are the foundation of the Delta Lake data lake architecture. They are created from raw data files and contain a schema that describes the data. This makes it easy to query and analyze the data in Bronze tables. Raw data files, on the other hand, do not have a schema applied. This means that it can be difficult to query and analyze the data in raw data files. Option A: Bronze tables typically contain more data than raw data files, because they include the schema. Option B: There is no indication that Bronze tables contain more truthful data than raw data. Option C: Bronze tables can contain aggregates, but they do not have to. Option D: Bronze tables typically contain a more refined view of data than raw data, because they include the schema.

Atnafu
Jul 8, 2023

Sorry this is meant to be on question #30

Atnafu
Jul 8, 2023

never mind :)

akk_1289Option: E
Jul 21, 2023

Ans: E https://www.databricks.com/glossary/medallion-architecture#:~:text=Bronze%20layer%20%28raw%20data%29

awofalusOption: E
Nov 7, 2023

E is correct

benni_aleOption: E
Apr 28, 2024

still i am not sure about the schema as i thought that correct types are usually defined in silver while in bronze are all strings