Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 51


Where in the Spark UI can one diagnose a performance problem induced by not leveraging predicate push-down?

Show Answer
Correct Answer: BE

Predicate push-down optimization reduces the amount of data read from the data source by applying filters early in the query process. To diagnose a performance problem caused by not leveraging predicate push-down, one should look at the size of the data read. The Stage’s Detail screen in the Completed Stages table shows the size of data read from the Input column, which would reveal if too much data is being read because predicate push-down is not being used.

Discussion

1 comment
Sign in to comment
P1314Option: E
Feb 7, 2024

Query plan. Correct is E