Certified Data Engineer Professional Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Professional Exam - Question 137


Which configuration parameter directly affects the size of a spark-partition upon ingestion of data into Spark?

Show Answer
Correct Answer: A

The configuration parameter 'spark.sql.files.maxPartitionBytes' directly affects the size of a spark-partition upon ingestion of data into Spark. This parameter specifies the maximum number of bytes to be packed into a single partition when reading files, thereby controlling the partition size directly.

Discussion

1 comment
Sign in to comment
vexor3Option: A
Jul 20, 2024

A is correct