Certified Data Engineer Associate Exam QuestionsBrowse all questions from this exam

Certified Data Engineer Associate Exam - Question 107


A data engineer has developed a data pipeline to ingest data from a JSON source using Auto Loader, but the engineer has not provided any type inference or schema hints in their pipeline. Upon reviewing the data, the data engineer has noticed that all of the columns in the target table are of the string type despite some of the fields only including float or boolean values.

Why has Auto Loader inferred all of the columns to be of the string type?

Show Answer
Correct Answer: B

JSON data is a text-based format in which data is stored as strings by default. Without explicit type inference or schema hints provided by the data engineer, Auto Loader defaults to treating all fields as strings because it cannot determine the correct data types just from text representation.

Discussion

1 comment
Sign in to comment
csrazdanOption: B
Jul 19, 2024

JSON file does not include datatype and all columns are defaulted as string