Exam Certified Associate Developer for Apache Spark All QuestionsBrowse all questions from this exam
Question 148

Which of the following operations is least likely to result in a shuffle?

    Correct Answer: B

    The operation DataFrame.filter() is least likely to result in a shuffle because it simply applies a condition to filter the rows within each partition. This operation doesn't require redistributing data across the partitions, unlike operations like join, orderBy, distinct, or intersect, which typically require data movement across partitions.

Discussion
Sowwy1Option: B

B. DataFrame.fliter()