Certified Machine Learning Professional Exam - Question 45

Question

A machine learning engineer is using the following code block as part of a batch deployment pipeline:

Which of the following changes needs to be made so this code block will work when the inference table is a stream source?

Examice · Accepted Answer

To work with a stream source in Apache Spark, you need to use the 'spark.readStream' method instead of 'spark.read'. This ensures that the DataFrame is being read as a streaming DataFrame, which is necessary for handling streaming data sources such as a Delta table that is continuously receiving new data.

trendy01 · Answer

C. Replace spark.read with spark.readStream

hugodscarvalho · Answer

This change ensures that the DataFrame is being read as a streaming DataFrame from the stream source.

mozuca · Answer

Agree with trendy01

Certified Machine Learning Professional Exam - Question 45

Discussion