DP-201 Exam - Question 86

Question

You are designing a solution that will copy Parquet files stored in an Azure Blob storage account to an Azure Data Lake Storage Gen2 account.

The data will be loaded daily to the data lake and will use a folder structure of {Year}/{Month}/{Day}/.

You need to design a daily Azure Data Factory data load to minimize the data transfer between the two accounts.

Which two configurations should you include in the design? Each correct answer presents part of the solution.

NOTE: Each correct selection is worth one point.

Examice · Accepted Answer

To minimize data transfer and adhere to the specified folder structure, you should filter by the last modified date of the source files and specify a file naming pattern for the destination. Filtering by the last modified date ensures that only new or updated files are copied each day, minimizing the amount of data transferred. Specifying a file naming pattern allows the data to be correctly placed into the folder structure {Year}/{Month}/{Day}/ in the destination Azure Data Lake Storage Gen2 account.

phi618t · Answer

If you choose C. Delete the source files after they are copied, why do you choose B. Filter by the last modified date of the source files? I prefer BD.

Wendy_DK · Answer

Correct answer is BC.
In the source option of copy activities. There are three choices: 1. No Action 2. Delete Source files 3. Move

BigMF · Answer

A is obviously out and you're are not going to do both B and C so D is in by default. Your only choice at that point is B or C to go along with D. In my experience, you cannot rely 100% on any job to run every single day (assuming this process is daily). Therefore, if the job does not run for one or more days, if you were to choose B you would only copy over the most recent files and there would be files left in the storage account. Therefore, my choice would be to not filter and load everything that is in the storage account and then delete the files once they have been copied. So, C and D are my choices.

maciejt · Answer

The was no requirement what to do with original files, so why i the world anwer C - delete them???

mter2007 · Answer

I would like to choose CD.

Nik71 · Answer

C seems not correcct as to deletion you can do life cycle mgmt in storage, so D should be second answer.

AlexD332 · Answer

thought it's the only logical choice but they said copy activity not moving files

DP-201 Exam - Question 86

Discussion