Exam Certified Data Engineer Associate All QuestionsBrowse all questions from this exam
Question 107

A data engineer has developed a data pipeline to ingest data from a JSON source using Auto Loader, but the engineer has not provided any type inference or schema hints in their pipeline. Upon reviewing the data, the data engineer has noticed that all of the columns in the target table are of the string type despite some of the fields only including float or boolean values.

Why has Auto Loader inferred all of the columns to be of the string type?

    Correct Answer: B

    JSON data is a text-based format in which data is stored as strings by default. Without explicit type inference or schema hints provided by the data engineer, Auto Loader defaults to treating all fields as strings because it cannot determine the correct data types just from text representation.

Discussion
csrazdanOption: B

JSON file does not include datatype and all columns are defaulted as string