Certified Associate Developer for Apache Spark Exam - Question 171

Question

Which of the following code blocks attempts to cache the partitions of DataFrame storesDF only in Spark’s memory?

Examice · Accepted Answer

The correct way to cache the partitions of DataFrame storesDF only in Spark's memory is by using the persist method with the StorageLevel.MEMORY_ONLY storage level. Therefore, the correct code block is storesDF.persist(StorageLevel.MEMORY_ONLY).count(). This ensures that the DataFrame is persistently stored in memory distinctively without spilling to disk.

Sowwy1 · Answer

D. storesDF.persist(StorageLevel.MEMORY_ONLY).count()

Certified Associate Developer for Apache Spark Exam - Question 171

Discussion