Certified Associate Developer for Apache Spark Exam QuestionsBrowse all questions from this exam

Certified Associate Developer for Apache Spark Exam - Question 10


Which of the following DataFrame operations is classified as a wide transformation?

Show Answer
Correct Answer: B

A wide transformation in the context of DataFrame operations involves shuffling or redistributing data across partitions, typically requiring data movement across the network. DataFrame.join() is classified as a wide transformation because it involves combining two DataFrames based on a common key column, which often necessitates shuffling and redistributing the data between partitions.

Discussion

3 comments
Sign in to comment
4be8126Option: B
May 3, 2023

B. DataFrame.join() is classified as a wide transformation, as it shuffles the data across the network to perform the join operation.

SonicBoom10C9Option: B
May 15, 2023

None of the other options are wide transformations, they are narrow (logically, they modify the length of a dataframe). Only a join can force shuffling of data between horizontally scaled partitions.

TmDataOption: B
Jun 17, 2023

The DataFrame operation classified as a wide transformation is: B. DataFrame.join() Explanation: In Spark, transformations are operations on DataFrames that create a new DataFrame from an existing one. Wide transformations involve shuffling or redistributing data across partitions and typically require data movement across the network. Among the options provided, DataFrame.join() is a wide transformation because it involves combining two DataFrames based on a common key column, which often requires shuffling and redistributing the data across partitions.