Certified Associate Developer for Apache Spark Exam QuestionsBrowse all questions from this exam

Certified Associate Developer for Apache Spark Exam - Question 126


Which of the following cluster configurations will induce the least network traffic during a shuffle operation?

Note: each configuration has roughly the same compute power using 100GB of RAM and 200 cores.

Show Answer
Correct Answer: C

Scenario 1 would induce the least network traffic during a shuffle operation because the computation is centralized within a single node. This avoids the need for data to be transferred across nodes, which typically happens during shuffle operations. Since all the data remains within one node, it eliminates network overhead, resulting in minimal network traffic.

Discussion

2 comments
Sign in to comment
SaiPavan10Option: C
Apr 4, 2024

C is the right choice

Sowwy1Option: C
Apr 9, 2024

Scenario #1 would likely induce the least network traffic during a shuffle operation because all the data would stay within the single node, avoiding any cross-node traffic. Therefore, the answer is: C. Scenario 1