Exam DP-203 All QuestionsBrowse all questions from this exam
Question 73

HOTSPOT

-

You have an Azure subscription that contains an Azure Synapse Analytics dedicated SQL pool.

You plan to deploy a solution that will analyze sales data and include the following:

• A table named Country that will contain 195 rows

• A table named Sales that will contain 100 million rows

• A query to identify total sales by country and customer from the past 30 days

You need to create the tables. The solution must maximize query performance.

How should you complete the script? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

    Correct Answer:

Discussion
FRANCIS_A_M

Correct! 1. Hash(CustomerID) 2. Replicate

zafnad

Could you please explain why 1. Hash([CustomerID]) is correct, and 2. Hash([OrderDate]) is incorrect.

ajhak

It is hash because it is a fact table (you can tell because there is the "total" column being created which is numerical). Rule of thumb, never hash on a date field, so in this case you would hash on 'CustomerID'. You want the hash to have as many unique values as possible.

vctrhugo

Never distribute on Date.

Spam_Account

Don't hash on date, only partition on date

nmnm22

correct

examtopicsofyannick

Correct. Hash on Sales Table(Fact) and Replicate on Country table(Dimension)

AnhHoang

1. You would want hash distribution to improve query performance. You don't want to hash on Date column since it can cause bottlenecks if many people query on a same date (eg. getdate()), so hash on customerid is the way. 2. You would want to replicate small tables across all distributions, so it can pick up any distribution and still have full data.

kkk5566

1. Hash(CustomerID) 2. Replicate

Deeksha1234

given answer is correct