Certified Associate Developer for Apache Spark Exam QuestionsBrowse all questions from this exam

Certified Associate Developer for Apache Spark Exam - Question 172


Which of the following operations will always return a new DataFrame with updated partitions from DataFrame storesDF by inducing a shuffle?

Show Answer
Correct Answer: C

The operation storesDF.repartition() will always return a new DataFrame with updated partitions from DataFrame storesDF by inducing a shuffle. This method is specifically designed to redistribute data across a different number of partitions, which requires shuffling the data.

Discussion

1 comment
Sign in to comment
Sowwy1Option: C
Apr 10, 2024

C. storesDF.repartition()