Certified Data Engineer Associate Exam - Question 37

Question

A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.

Which of the following approaches can the data engineer use to set up the new task?

Examice · Accepted Answer

A data engineer needs to set up a new task to run before the original task due to an upstream data issue. To achieve this, they can create a new task in the existing Job and then configure the original task to depend on the new task. This means the original task will only run once the new task has completed, ensuring the new task runs first. This is the correct approach to make sure the new notebook is executed before proceeding with the original task.

Data_4ever · Answer

B is the right answer.

Redwings538 · Answer

It seems there is some confusion on what dependency means in this case. Option B is correct because adding the new task as a dependency of the original task means that the new task will run BEFORE the original task, which is the goal defined in the question.

Nika12 · Answer

Just got 100% on the test. B was correct.

ObeOne · Answer

"A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task."

In the tasks UI of the Job:
1. Create a *new task*
2. Select *original task*
3. In *original task* for "depends on" enter *new task" - as *new task* needs to run prior to *original task*, ie, original task has a dependency on new task

from 1. create new task ..... from 3. original task has a dependency on new task

Answer is C ... They can *create a new task* in the existing Job and then add the *original task as a dependency of the new task*.

Tinendra · Answer

Answer is C

vctrhugo · Answer

C. They can create a new task in the existing Job and then add the original task as a dependency of the new task.

To set up a new task that runs a new notebook prior to the original task in an existing Job, you can create a new task within the same Job and then set the original task as a dependency for the new task. This way, the new task will execute before the original task when the Job is triggered.

Garyn · Answer

The data engineer can create a new task in the existing Job and then add the original task as a dependency of the new task (Option C). This way, the new task will run first, and once it’s completed, the original task will run. Here are the steps to do this:

Click Workflows in the sidebar and click New and select Job.
The Tasks tab appears with the create task dialog.
Replace Add a name for your job… with your job name.
Enter a name for the task in the Task name field.
In the Type drop-down menu, select the type of task to run.
Configure the cluster where the task runs.
To add dependent libraries, click + Add next to Dependent libraries.
You can pass parameters for your task.
Please note that the exact process may vary depending on the specific configurations and permissions set up in your workspace. It’s always a good idea to consult with your organization’s IT or data governance team to ensure the correct procedures are followed.

[Removed] · Answer

B is the right answer

ObeOne · Answer

C is correct

awofalus · Answer

Correct is C because original task will run after the newer, and then, depend on it

Mircuz · Answer

C because the new task has to run prior the original one

kokosz · Answer

B is the right answer.

kbaba101 · Answer

This is a Grammar issue not a Databricks issue: 
Add A as a dependency of B means A must run before B.

DavidRou · Answer

Right answer: B
We need to add the new task as a dependency of the original one because the question says that it needs to be run before the original task.

nedlo · Answer

I am pretty sure its B - "they need to set up another task to run a new notebook prior to the original task." - so NEW task need to run BEFORE ORIGINAL task. So NEW TASK should be DEPENDENCY of ORIGINAL TASK (or in other words: original task is dependent on new task)

Shaxxie · Answer

This has become more of a English grammatical test as the word dependency is confusing people.  When the Original task has a dependency on the new task this means the original task needs to depend on the new task.  So it's Option C.

benni_ale · Answer

original depends on new

Certified Data Engineer Associate Exam - Question 37

Discussion