Exam Certified Associate Developer for Apache Spark All QuestionsBrowse all questions from this exam
Question 61

Which of the following storage levels should be used to store as much data as possible in memory on two cluster nodes while storing any data that does not fit in memory on disk to be read in when needed?

    Correct Answer: D

    MEMORY_AND_DISK_2 is the correct storage level to store as much data as possible in memory on two cluster nodes. This option ensures that data will be stored in memory across the two cluster nodes first. If the data cannot fit in memory, it will then be stored on disk to be read in when needed. This provides a balance of in-memory performance and disk storage fallback.

Discussion
newusernameOption: D

Correct is D

juadavesOption: B

B https://stackoverflow.com/questions/30520428/what-is-the-difference-between-memory-only-and-memory-and-disk-caching-level-in

newusername

not sure how you came to the conclusion of B after that article, but it states clearly in the question "on two cluster nodes" ! So, the answer is D