Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 3


When scheduling Structured Streaming jobs for production, which configuration automatically recovers from query failures and keeps costs low?

Show Answer
Correct Answer: D

To schedule Structured Streaming jobs for production with an automatic recovery from query failures while keeping costs low, you should use a new job cluster to ensure an isolated and streamlined environment, set retries to unlimited to handle and recover from any failures automatically, and limit maximum concurrent runs to 1 to avoid resource contention and ensure only one instance of the query runs at a time. This configuration efficiently manages resources and ensures the job's reliability and cost-effectiveness.

Discussion

8 comments
Sign in to comment
8605246Option: D
Aug 5, 2023

the answer given is correct: Maximum concurrent runs: Set to 1. There must be only one instance of each query concurrently active. Retries: Set to Unlimited. https://docs.databricks.com/en/structured-streaming/query-recovery.html

sturcuOption: D
Oct 16, 2023

D is correct

kz_dataOption: D
Dec 21, 2023

D is correct

Jay_98_11Option: D
Jan 13, 2024

D is correct

AziLaOption: D
Jan 21, 2024

Correct Ans is D

juliom6Option: D
Apr 8, 2024

D is correct https://docs.databricks.com/en/structured-streaming/query-recovery.html

imatheushenriqueOption: D
Jun 1, 2024

D. Cluster: New Job Cluster; Retries: Unlimited; Maximum Concurrent Runs: 1

imatheushenriqueOption: D
Jun 5, 2024

D. Cluster: New Job Cluster; Retries: Unlimited; Maximum Concurrent Runs: 1