Certified Data Engineer Professional Exam - Question 1

Question

An upstream system has been configured to pass the date for a given batch of data to the Databricks Jobs API as a parameter. The notebook to be scheduled will use this parameter to load data with the following code: df = spark.read.format("parquet").load(f"/mnt/source/(date)")

Which code block should be used to create the date Python variable used in the above code block?

Examice · Accepted Answer

To create the date variable for use in the notebook, the code block should interact with the Databricks Jobs API to retrieve the parameter that has been passed by the upstream system. The appropriate method for this purpose is to use dbutils.notebooks.getParam, which is designed to retrieve parameters passed to the notebook from external sources like the Databricks Jobs API. This ensures that the value of 'date' is accurately obtained from the API, aligning with the requirements of the given scenario.

BrianNguyen95 · Answer

E is correct answer

hal2401me · Answer

dbutils.widget.
Just passed the exam with score >80%.
examtopics covers about 90% of questions. there were 5 questions I didn't see here in examtopics.
But friends, you need to look at the discussions, and do test yourself.
many answers provided here, even most voted answer, does NOT exists anymore in the exam - not the question, but the answer.
Wish you all good luck, friends!

HappyPrince · Answer

Correct

ThunderToes18 · Answer

Few questions on the exam... 9/29, wait for an update or more iterations...

kz_data · Answer

E is correct

RafaelCFC · Answer

In https://docs.databricks.com/en/notebooks/notebook-workflows.html#dbutilsnotebook-api the "run Example" is an equivalent use-case as E.

Jay_98_11 · Answer

E is correct

lokvamsi · Answer

Correct. Ans: E

chokthewa · Answer

I think D is correct answer, refer to https://docs.databricks.com/en/notebooks/notebook-workflows.html#dbutilsnotebook-api

AziLa · Answer

Correct Ans is E

hal2401me · Answer

vote for E
dbutils.widget

Sosicha · Answer

Are you reading the question? It asks about an upstream system that has been configured to pass the date for a given batch of data to the Databricks Jobs API as a parameter. Upstream system usually don't use widgets. Widgets they are made for humans. Only C and D are correct but D is better so D.

AziLa · Answer

correct ans is E

imatheushenrique · Answer

E.
E. dbutils.widgets.text("date", "null")
date = dbutils.widgets.get("date")

Isio05 · Answer

Widgets are used to create parameters in notebook that can be then utilized by e.g. jobs

Shailly · Answer

Answer is E.
Even though the value is passed from a upstream system, you can create parameters using widgets inside notebook and use the value as an input from the databricks jobs API.

Certified Data Engineer Professional Exam - Question 1

Discussion