Which of the following operations is least likely to result in a shuffle?
Which of the following operations is least likely to result in a shuffle?
The operation DataFrame.filter() is least likely to result in a shuffle because it simply applies a condition to filter the rows within each partition. This operation doesn't require redistributing data across the partitions, unlike operations like join, orderBy, distinct, or intersect, which typically require data movement across partitions.
B. DataFrame.fliter()