Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 16


A table is registered with the following code:

Both users and orders are Delta Lake tables. Which statement describes the results of querying recent_orders?

Show Answer
Correct Answer: BD

When using a CREATE TABLE AS SELECT (CTAS) statement in Delta Lake, the logic executes at the time the table is defined. The result of the join operation is stored in the Databricks File System (DBFS). This means that the data resulting from the join operation at the time the table was created is stored, and this stored data will be returned whenever the table is queried. The CTAS statement does not recompute the join at query time. Instead, it queries the already stored result set.

Discussion

16 comments
Sign in to comment
asmayassinegOption: B
Aug 2, 2023

Correct answer is B. table is created and data of join will be stored on DBFS and it will be returned on query time

BrianNguyen95Option: B
Aug 17, 2023

correct answer is B

robson90Option: D
Aug 23, 2023

Aa ok, I missed "logic will execute at query time" ignore my previous comment

lucasasterioOption: B
Sep 10, 2023

correct is B

BIKRAM063Option: B
Nov 2, 2023

Correct is B . CTAS command

robson90Option: D
Aug 23, 2023

Why not D? Table does not need to be stored on DBFS if using Unity Catalog. At least that's my understanding https://docs.databricks.com/en/dbfs/unity-catalog.html

SantitoxicOption: D
Sep 22, 2023

Based on typical Delta Lake behavior, option D is the most accurate description. Delta Lake queries generally execute at query time and retur n results based on the state of the source tables at the time the query began. Delta Lake provides features for managing data versions and transactions, but it doesn't precompute and store results like option B or cache results like option C.

Starvosxant
Oct 9, 2023

No. Simple no. When you execute a create table even with a Join you DONT see the results imediatly unless you query the table. So correct answer is B. The create table statement by default creates a Managed table, which is stored in DBFS.

Naveenkm
Nov 27, 2023

Its create separate delta lake transaction logs for this new table. So B is the correct the answer.

sturcuOption: B
Oct 11, 2023

Creating a table will not display results. You need to make a select alter it is created.

aragorn_bregoOption: B
Nov 21, 2023

The correct answer is: B. All logic will execute when the table is defined and store the result of joining tables to the DBFS; this stored data will be returned when the table is queried. When the CREATE TABLE AS statement is executed, it runs the enclosed SELECT statement immediately to pull the current data from the users and orders tables where the order_date is within the last 7 days. This result is then stored as a new table called recent_orders in the Delta Lake on the DBFS (Databricks File System). Subsequent queries against recent_orders will return this stored data, and not recompute the join unless the table is updated or refreshed.

cryptoflamOption: B
Jan 5, 2024

Key here is that option D says "returned". The CTAS statement does not return results, thus option B is correct.

IWantCertsOption: B
Jan 9, 2024

B is correct. Views compute when query is executed, not when defined. And vice versa for tables.

kz_dataOption: B
Jan 10, 2024

I think B is the correct answer

PrashantTiwariOption: B
Feb 9, 2024

B is correct

coercionOption: B
May 19, 2024

"Create Table" is an action so "B"

imatheushenriqueOption: B
Jun 1, 2024

B. All logic will execute when the table is defined and store the result of joining tables to the DBFS; this stored data will be returned when the table is queried.

Isio05Option: B
Jun 8, 2024

CTAS statements persist it results, so B