If a Snowflake user decides a table should be clustered, what should be used as the cluster key?
If a Snowflake user decides a table should be clustered, what should be used as the cluster key?
The columns most actively used in the select filters should be used as the cluster key. Clustering helps efficiently retrieve data that matches filtering criteria. By using columns frequently involved in select filters, Snowflake can optimize query performance by organizing data storage in a way that enhances retrieval speed for those specific queries.
D correct
Cardinality show be optimum not too high or to low - in document
D is correct answer. Snowflake recommends prioritizing keys in the order below: Cluster columns that are most actively used in selective filters. For many fact tables involved in date-based queries (for example “WHERE invoice_date > x AND invoice date <= y”), choosing the date column is a good idea. For event tables, event type might be a good choice, if there are a large number of different event types. (If your table has only a small number of different event types, then see the comments on cardinality below before choosing an event column as a clustering key.) If there is room for additional cluster keys, then consider columns frequently used in join predicates, for example “FROM table1 JOIN table2 ON table2.column_A = table1.column_B”.
correct