DP-203 Exam QuestionsBrowse all questions from this exam

DP-203 Exam - Question 73


HOTSPOT

-

You have an Azure subscription that contains an Azure Synapse Analytics dedicated SQL pool.

You plan to deploy a solution that will analyze sales data and include the following:

• A table named Country that will contain 195 rows

• A table named Sales that will contain 100 million rows

• A query to identify total sales by country and customer from the past 30 days

You need to create the tables. The solution must maximize query performance.

How should you complete the script? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Show Answer
Correct Answer:

Discussion

6 comments
Sign in to comment
FRANCIS_A_M
Apr 3, 2023

Correct! 1. Hash(CustomerID) 2. Replicate

zafnad
May 11, 2023

Could you please explain why 1. Hash([CustomerID]) is correct, and 2. Hash([OrderDate]) is incorrect.

ajhak
May 19, 2023

It is hash because it is a fact table (you can tell because there is the "total" column being created which is numerical). Rule of thumb, never hash on a date field, so in this case you would hash on 'CustomerID'. You want the hash to have as many unique values as possible.

vctrhugo
Jun 24, 2023

Never distribute on Date.

Spam_Account
Jul 2, 2023

Don't hash on date, only partition on date

nmnm22
Apr 9, 2023

correct

examtopicsofyannick
Aug 2, 2023

Correct. Hash on Sales Table(Fact) and Replicate on Country table(Dimension)

Deeksha1234
Aug 15, 2023

given answer is correct

kkk5566
Sep 4, 2023

1. Hash(CustomerID) 2. Replicate

AnhHoang
Jul 17, 2024

1. You would want hash distribution to improve query performance. You don't want to hash on Date column since it can cause bottlenecks if many people query on a same date (eg. getdate()), so hash on customerid is the way. 2. You would want to replicate small tables across all distributions, so it can pick up any distribution and still have full data.