Professional Data Engineer Exam QuestionsBrowse all questions from this exam

Professional Data Engineer Exam - Question 303


You are managing a Dataplex environment with raw and curated zones. A data engineering team is uploading JSON and CSV files to a bucket asset in the curated zone but the files are not being automatically discovered by Dataplex. What should you do to ensure that the files are discovered by Dataplex?

Show Answer
Correct Answer: BD

To ensure that files are automatically discovered by Dataplex, you need to enable the auto-discovery feature for the curated zone. This feature, when enabled, allows Dataplex to automatically discover and catalog data assets within a zone, including JSON and CSV files. Without enabling this feature, Dataplex will not be able to detect the presence of new files in the curated zone.

Discussion

14 comments
Sign in to comment
GCP001Option: A
Jan 17, 2024

Should be A. Curated zone need Parquet, Avro, ORC format not CSV or JSON. Check the ref - https://cloud.google.com/dataplex/docs/add-zone#curated-zones

raaadOption: B
Jan 6, 2024

- Auto-Discovery Feature: Dataplex has an auto-discovery feature that, when enabled, automatically discovers and catalogs data assets within a zone. - Appropriate for Both Raw and Curated Zones: This feature is applicable to both raw and curated zones, and it should be tailored to the specific data governance and cataloging needs of the organization.

Matt_108Option: B
Jan 13, 2024

I'd go for Option B, auto-discovery is enabled by default for any zone, including curated ones, so if a file is not automatically discovered it's due to the disabled auto-discovery

ML6
Feb 18, 2024

In this case, it would be because of invalid data format in curated zones (data not in Avro, Parquet, or ORC formats).

Sofiia98Option: A
Jan 11, 2024

I will go with A, check the ref. Curated zones only store Parquet, Avro, and ORC in CS, and well-defined schema and Hive-style partitions in the BigQuery: https://cloud.google.com/dataplex/docs/add-zone#curated-zones

dungctOption: B
Feb 12, 2024

Discovery raises the following administrator actions whenever data-related issues are detected during scans : Inconsistent data format in a table. For example, files of different formats exist with the same table prefix. Inconsistent data format in a table. For example, files of different formats exist with the same table prefix.

dungct
Feb 12, 2024

https://cloud.google.com/dataplex/docs/discover-data#invalid_data_format

daidai75Option: B
Feb 22, 2024

According to this URL: https://cloud.google.com/dataplex/docs/discover-data, the auto-discovery can support CSV and Json in both Raw-Zone and Curated-Zone. I also open a console the verify it, both Raw and Curated zone can set up csv&json auto-discovery.

Anudeep58Option: A
Jun 17, 2024

While none of the original options (A, B, C, or D) directly address the issue, the closest solution is: Move the JSON and CSV files to a raw zone. (This was previously marked as the most voted option, but it's not ideal due to data organization disruption) Here's why this approach might be necessary (but not ideal): Dataplex curated zones currently don't support native processing of JSON and CSV formats. They are designed for structured data formats like Parquet, Avro, or ORC.

scaenruyOption: A
Jan 4, 2024

A. Move the JSON and CSV files to the raw zone.

kck6ra4214wmOption: A
Feb 28, 2024

Dataplex does not allow users to create CSV files within a “curated zone”

Moss2011Option: A
Mar 1, 2024

The answer can be found reading a common config of Dataplex in this URL: https://medium.com/google-cloud/google-cloud-dataplex-part-1-lakes-zones-assets-and-discovery-5f288486cb2f

demoro86Option: A
Mar 5, 2024

GCP001 agree with him

joao_01Option: B
Apr 12, 2024

Its B guys, i encounter this in my job, and I had to do B to make it work

joao_01
Apr 12, 2024

Actually I did this in a Raw zone, not Curated.

joao_01
Apr 12, 2024

Its A :)

chrissamharrisOption: A
May 2, 2024

Option A https://cloud.google.com/dataplex/docs/add-zone#raw-zones Raw zones are the only zones that support CSV & JSON

hussain.sainOption: B
Jul 3, 2024

While JSON and CSV can technically be stored in curated zones, it is not a common practice due to the reasons mentioned above. no where in the mention link its mention that there is a restriction.