Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 37


A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.

Which of the following approaches can the data engineer use to set up the new task?

Show Answer
Correct Answer: CE

A data engineer needs to set up a new task to run before the original task due to an upstream data issue. To achieve this, they can create a new task in the existing Job and then configure the original task to depend on the new task. This means the original task will only run once the new task has completed, ensuring the new task runs first. This is the correct approach to make sure the new notebook is executed before proceeding with the original task.

Discussion

17 comments
Sign in to comment
Data_4everOption: B
Apr 7, 2023

B is the right answer.

Redwings538Option: B
Apr 14, 2023

It seems there is some confusion on what dependency means in this case. Option B is correct because adding the new task as a dependency of the original task means that the new task will run BEFORE the original task, which is the goal defined in the question.

Nika12Option: B
Jan 27, 2024

Just got 100% on the test. B was correct.

ObeOneOption: C
Nov 8, 2023

"A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task." In the tasks UI of the Job: 1. Create a *new task* 2. Select *original task* 3. In *original task* for "depends on" enter *new task" - as *new task* needs to run prior to *original task*, ie, original task has a dependency on new task from 1. create new task ..... from 3. original task has a dependency on new task Answer is C ... They can *create a new task* in the existing Job and then add the *original task as a dependency of the new task*.

TinendraOption: C
Dec 27, 2023

Answer is C

vctrhugoOption: C
Sep 4, 2023

C. They can create a new task in the existing Job and then add the original task as a dependency of the new task. To set up a new task that runs a new notebook prior to the original task in an existing Job, you can create a new task within the same Job and then set the original task as a dependency for the new task. This way, the new task will execute before the original task when the Job is triggered.

AndreFR
Dec 20, 2023

I disagree. "the original task as a dependency of the new task" means that the original task needs to run first.

GarynOption: C
Dec 30, 2023

The data engineer can create a new task in the existing Job and then add the original task as a dependency of the new task (Option C). This way, the new task will run first, and once it’s completed, the original task will run. Here are the steps to do this: Click Workflows in the sidebar and click New and select Job. The Tasks tab appears with the create task dialog. Replace Add a name for your job… with your job name. Enter a name for the task in the Task name field. In the Type drop-down menu, select the type of task to run. Configure the cluster where the task runs. To add dependent libraries, click + Add next to Dependent libraries. You can pass parameters for your task. Please note that the exact process may vary depending on the specific configurations and permissions set up in your workspace. It’s always a good idea to consult with your organization’s IT or data governance team to ensure the correct procedures are followed.

[Removed]Option: B
Aug 29, 2023

B is the right answer

ObeOneOption: C
Oct 31, 2023

C is correct

awofalusOption: C
Nov 7, 2023

Correct is C because original task will run after the newer, and then, depend on it

AndreFR
Dec 20, 2023

I disagree. "the original task as a dependency of the new task" means that the original task needs to run first.

MircuzOption: C
Mar 5, 2024

C because the new task has to run prior the original one

kokoszOption: B
May 23, 2024

B is the right answer.

kbaba101Option: C
Oct 20, 2023

This is a Grammar issue not a Databricks issue: Add A as a dependency of B means A must run before B.

DavidRouOption: B
Oct 31, 2023

Right answer: B We need to add the new task as a dependency of the original one because the question says that it needs to be run before the original task.

nedloOption: B
Dec 12, 2023

I am pretty sure its B - "they need to set up another task to run a new notebook prior to the original task." - so NEW task need to run BEFORE ORIGINAL task. So NEW TASK should be DEPENDENCY of ORIGINAL TASK (or in other words: original task is dependent on new task)

ShaxxieOption: C
Jan 18, 2024

This has become more of a English grammatical test as the word dependency is confusing people. When the Original task has a dependency on the new task this means the original task needs to depend on the new task. So it's Option C.

benni_aleOption: B
Apr 29, 2024

original depends on new