Exam Certified Data Engineer Associate All QuestionsBrowse all questions from this exam
Question 37

A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.

Which of the following approaches can the data engineer use to set up the new task?

    Correct Answer: C

    A data engineer needs to set up a new task to run before the original task due to an upstream data issue. To achieve this, they can create a new task in the existing Job and then configure the original task to depend on the new task. This means the original task will only run once the new task has completed, ensuring the new task runs first. This is the correct approach to make sure the new notebook is executed before proceeding with the original task.

Discussion
Data_4everOption: B

B is the right answer.

Redwings538Option: B

It seems there is some confusion on what dependency means in this case. Option B is correct because adding the new task as a dependency of the original task means that the new task will run BEFORE the original task, which is the goal defined in the question.

Nika12Option: B

Just got 100% on the test. B was correct.

TinendraOption: C

Answer is C

ObeOneOption: C

"A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task." In the tasks UI of the Job: 1. Create a *new task* 2. Select *original task* 3. In *original task* for "depends on" enter *new task" - as *new task* needs to run prior to *original task*, ie, original task has a dependency on new task from 1. create new task ..... from 3. original task has a dependency on new task Answer is C ... They can *create a new task* in the existing Job and then add the *original task as a dependency of the new task*.

GarynOption: C

The data engineer can create a new task in the existing Job and then add the original task as a dependency of the new task (Option C). This way, the new task will run first, and once it’s completed, the original task will run. Here are the steps to do this: Click Workflows in the sidebar and click New and select Job. The Tasks tab appears with the create task dialog. Replace Add a name for your job… with your job name. Enter a name for the task in the Task name field. In the Type drop-down menu, select the type of task to run. Configure the cluster where the task runs. To add dependent libraries, click + Add next to Dependent libraries. You can pass parameters for your task. Please note that the exact process may vary depending on the specific configurations and permissions set up in your workspace. It’s always a good idea to consult with your organization’s IT or data governance team to ensure the correct procedures are followed.

vctrhugoOption: C

C. They can create a new task in the existing Job and then add the original task as a dependency of the new task. To set up a new task that runs a new notebook prior to the original task in an existing Job, you can create a new task within the same Job and then set the original task as a dependency for the new task. This way, the new task will execute before the original task when the Job is triggered.

AndreFR

I disagree. "the original task as a dependency of the new task" means that the original task needs to run first.

kokoszOption: B

B is the right answer.

MircuzOption: C

C because the new task has to run prior the original one

awofalusOption: C

Correct is C because original task will run after the newer, and then, depend on it

AndreFR

I disagree. "the original task as a dependency of the new task" means that the original task needs to run first.

ObeOneOption: C

C is correct

[Removed]Option: B

B is the right answer

benni_aleOption: B

original depends on new

ShaxxieOption: C

This has become more of a English grammatical test as the word dependency is confusing people. When the Original task has a dependency on the new task this means the original task needs to depend on the new task. So it's Option C.

nedloOption: B

I am pretty sure its B - "they need to set up another task to run a new notebook prior to the original task." - so NEW task need to run BEFORE ORIGINAL task. So NEW TASK should be DEPENDENCY of ORIGINAL TASK (or in other words: original task is dependent on new task)

DavidRouOption: B

Right answer: B We need to add the new task as a dependency of the original one because the question says that it needs to be run before the original task.

kbaba101Option: C

This is a Grammar issue not a Databricks issue: Add A as a dependency of B means A must run before B.