Exam Certified Associate Developer for Apache Spark All QuestionsBrowse all questions from this exam
Question 126

Which of the following cluster configurations will induce the least network traffic during a shuffle operation?

Note: each configuration has roughly the same compute power using 100GB of RAM and 200 cores.

    Correct Answer: C

    Scenario 1 would induce the least network traffic during a shuffle operation because the computation is centralized within a single node. This avoids the need for data to be transferred across nodes, which typically happens during shuffle operations. Since all the data remains within one node, it eliminates network overhead, resulting in minimal network traffic.

Discussion
Sowwy1Option: C

Scenario #1 would likely induce the least network traffic during a shuffle operation because all the data would stay within the single node, avoiding any cross-node traffic. Therefore, the answer is: C. Scenario 1

SaiPavan10Option: C

C is the right choice