You have an Azure Synapse Analytics dedicated SQL pool named Pool1. Pool1 contains a fact table named Table1.
You need to identify the extent of the data skew in Table1.
What should you do in Synapse Studio?
You have an Azure Synapse Analytics dedicated SQL pool named Pool1. Pool1 contains a fact table named Table1.
You need to identify the extent of the data skew in Table1.
What should you do in Synapse Studio?
To identify the extent of data skew in a fact table named Table1 within an Azure Synapse Analytics dedicated SQL pool, you should connect to Pool1 and run the DBCC PDW_SHOWSPACEUSED command. This system stored procedure provides detailed information about how data is distributed across the compute nodes, helping to ascertain if the data is evenly distributed or skewed towards specific nodes.
For the "Exam Topics" team: To begin with, your questions vs answers are completely wrong., period. Check your answer for the question#36 in the same page itself!!! Why you are misleading us who are preparing seriously for the exam?? I need an immediate explanation why these questions Q#36 and Q#38 with different answers being at the same question pattern??? Seriously.
This is repeated way too many times.
Connect to Pool1 and run DBCC PDW_SHOWSPACEUSED Azure Synapse Analytics dedicated SQL pool (formerly known as Azure Synapse Analytics Parallel Data Warehouse) uses a Massively Parallel Processing (MPP) architecture and DBCC PDW_SHOWSPACEUSED is a system stored procedure that can be used to check the distribution of data across the compute nodes. By running this command on Pool1 and specifying the fact table Table1, you can identify the extent of data skew in Table1 and determine if the data is evenly distributed across the compute nodes or if it is skewed towards a specific node
Its A we need to connect to Pool1
It's A
To identify the extent of data skew in Table1, you should connect to Pool1 and run DBCC PDW_SHOWSPACEUSED. DBCC PDW_SHOWSPACEUSED is a Dynamic Management View (DMV) that provides information about the physical storage of data in a Parallel Data Warehouse (PDW) instance. This includes the distribution of data across partitions and the amount of space used by each partition. By running DBCC PDW_SHOWSPACEUSED, you can identify partitions that are storing a disproportionately large amount of data. These partitions may be indicative of data skew.
(H)Agreed!
DBCC PDW_SHOWSPACEUSED is a command that can be executed in the context of the dedicated SQL pool (formerly SQL DW). It provides detailed information about how data is distributed across distributions (similar to shards or segments) in the underlying storage of the dedicated SQL pool. By running this command against Pool1, you can see the distribution of data across the distributions. This includes information about the number of rows per distribution, which helps in identifying data skew. B is not an option because there's no specific concept of a "built-in pool" in Azure Synapse Analytics dedicated SQL pool context
you need to access the built-in to check out the DMVs and to use DBCCs.
A is correct
PDW_SHOWSPACEUSED
A for sure
Question 36 from the same topic has the same question but as right answer D. So what is the right answer here?
Looks like he is misleading us?