Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 23


Which statement characterizes the general programming model used by Spark Structured Streaming?

Show Answer
Correct Answer: D

Structured Streaming models new data arriving in a data stream as new rows appended to an unbounded table. This concept is integral to understanding how Spark Structured Streaming operates. It treats streaming data as an append-only table, allowing for continuous incremental processing, which is similar in operation to traditional batch processing but applied in a real-time context.

Discussion

5 comments
Sign in to comment
sturcuOption: D
Oct 11, 2023

Correct. Structured streaming needs to be considered as a table with append

mardigrasOption: D
Feb 28, 2024

Yes. answer is D

8605246Option: D
Aug 6, 2023

correct; The key idea in Structured Streaming is to treat a live data stream as a table that is being continuously appended. This leads to a new stream processing model that is very similar to a batch processing model. You will express your streaming computation as standard batch-like query as on a static table, and Spark runs it as an incremental query on the unbounded input table. Let’s understand this model in more detail. https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html

Jay_98_11Option: D
Jan 13, 2024

vote for D

imatheushenriqueOption: D
Jun 1, 2024

D. Structured Streaming models new data arriving in a data stream as new rows appended to an unbounded table.