DP-200 Exam - Question 106

Question

DRAG DROP -You have an Azure Data Lake Storage Gen2 account that contains JSON files for customers. The files contain two attributes named FirstName and LastName. You need to copy the data from the JSON files to an Azure Synapse Analytics table by using Azure Databricks. A new column must be created that concatenates the FirstName and LastName values. You create the following components:✑ A destination table in Azure Synapse✑ An Azure Blob storage container✑ A service principalWhich five actions should you perform in sequence next in a Databricks notebook? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order. Select and Place:.

Examice · Accepted Answer

Step 1: Read the file into a data frame.

You can load the json files as a data frame in Azure Databricks.

Step 2: Perform transformations on the data frame.

Step 3:Specify a temporary folder to stage the data

Specify a temporary folder to use while moving data between Azure Databricks and Azure Synapse.

Step 4: Write the results to a table in Azure Synapse.

You upload the transformed data frame into Azure Synapse. You use the Azure Synapse connector for Azure Databricks to directly upload a dataframe as a table in a Azure Synapse.

Step 5: Drop the data frame -

Clean up resources. You can terminate the cluster. From the Azure Databricks workspace, select Clusters on the left. For the cluster to terminate, under Actions, point to the ellipsis (...) and select the Terminate icon.

Reference:

https://docs.microsoft.com/en-us/azure/azure-databricks/databricks-extract-load-sql-data-warehouse

cadio30 · Answer

It requires to mount the ALDS gen 2 thus the sequence is right "FHEAB".

hoangton · Answer

Correct answer should be:
Step 1: Mount the Data Lake Storage onto DBFS
Step 2: Read the file into a data frame.
Step 3: Perform transformations on the data frame.
Step 4: Specify a temporary folder to stage the data
Step 5: Write the results to a table in Azure Synapse.

Aragorn_2021 · Answer

I would go for FHEAB. Mount the storage -> Read the file to a dataframe ->transform it further -> write the data to temporary folder in storage -> and load to DWH

vrmei · Answer

Mount Data Lake Storage onto DBFS (Service Principal)
Read the file into data frame
Perform Transformation on the data frame
Specify the temp folder to stage data
write results to synapse table
https://docs.microsoft.com/en-us/azure/databricks/scenarios/databricks-extract-load-sql-data-warehouse

alf99 · Answer

wrong, should be F,H,E,A,B. The DataLake storage has to be mounted onto DBFS before red the file

tucho · Answer

I agree with HEAB. But I don' know which is the missing one. I think there is no need to "drop the DF" or to "mount the DL storage"... :-( Does anybody know the right full answer?

unidigm · Answer

Do we really need to stage the data? We could directly write the dataframe to Synapse.
https://docs.microsoft.com/en-us/azure/databricks/data/data-sources/azure/synapse-analytics

Bhagya123456 · Answer

The Answer is Perfect. 
Mounting is not Required.
Drop Data Frame should be there.

The question never mentioned that you have to use Service Principle. Had it be 6 steps I would have added Mounting Steps. But Considering only 5 steps, the below 5 steps have more priority then Mounting (not an essential).

DP-200 Exam - Question 106

Discussion