Certified Associate Developer for Apache Spark Exam QuestionsBrowse all questions from this exam

Certified Associate Developer for Apache Spark Exam - Question 61


Which of the following storage levels should be used to store as much data as possible in memory on two cluster nodes while storing any data that does not fit in memory on disk to be read in when needed?

Show Answer
Correct Answer: D

MEMORY_AND_DISK_2 is the correct storage level to store as much data as possible in memory on two cluster nodes. This option ensures that data will be stored in memory across the two cluster nodes first. If the data cannot fit in memory, it will then be stored on disk to be read in when needed. This provides a balance of in-memory performance and disk storage fallback.

Discussion

2 comments
Sign in to comment
juadavesOption: B
Oct 18, 2023

B https://stackoverflow.com/questions/30520428/what-is-the-difference-between-memory-only-and-memory-and-disk-caching-level-in

newusername
Nov 7, 2023

not sure how you came to the conclusion of B after that article, but it states clearly in the question "on two cluster nodes" ! So, the answer is D

newusernameOption: D
Nov 7, 2023

Correct is D