DP-200 Exam QuestionsBrowse all questions from this exam

DP-200 Exam - Question 1


You are a data engineer implementing a lambda architecture on Microsoft Azure. You use an open-source big data solution to collect, process, and maintain data.

The analytical data store performs poorly.

You must implement a solution that meets the following requirements:

✑ Provide data warehousing

✑ Reduce ongoing management activities

✑ Deliver SQL query responses in less than one second

You need to create an HDInsight cluster to meet the requirements.

Which type of cluster should you create?

Show Answer
Correct Answer: D

D

Lambda Architecture with Azure:

Azure offers you a combination of following technologies to accelerate real-time big data analytics:

1. Azure Cosmos DB, a globally distributed and multi-model database service.

2. Apache Spark for Azure HDInsight, a processing framework that runs large-scale data analytics applications.

3. Azure Cosmos DB change feed, which streams new data to the batch layer for HDInsight to process.

4. The Spark to Azure Cosmos DB Connector

Exam DP-200 Question 1

Note: Lambda architecture is a data-processing architecture designed to handle massive quantities of data by taking advantage of both batch processing and stream processing methods, and minimizing the latency involved in querying big data.

References:

https://sqlwithmanoj.com/2018/02/16/what-is-lambda-architecture-and-what-azure-offers-with-its-new-cosmos-db/

Discussion

17 comments
Sign in to comment
dangal95
Apr 24, 2021

Could the write answer be D becase Spark has; 1) Interactive queries through spark-sql 2) Datawarehousing capabilities through Delta Lake (and also spark-sql creates in memory tables) 3) Less management because these are out-of-the-box features?

Amy007
May 11, 2021

It also mentions SQL queries which is not Hive ( Interactive Query)

jedi01
Aug 19, 2020

I think the answer should be A. Interactive Query. Here I am implementing Lambda architecture using a open source technology which can be Apache Spark and already in use. The prevailing issue here Analytical Processing is very slow , in another words queries are slow. So I created an HDInsight Cluster of type "Interactive Query" to support Analytical processing/ fast query access, data warehousing etc. We can use HiveQL on Interactive Query. Refer to https://docs.microsoft.com/en-us/azure/hdinsight/interactive-query/apache-interactive-query-get-started

J_i_L_L
Nov 30, 2020

Exam was updated on Nov 24, 2020. Didn't see too many questions from the test on ExamTopics...maybe 20-30% of the test questions. Suggest waiting a bit to take the test so that all the exam prep questions are updated. Exam definitely requires hands-on knowledge of the products. A lot of questions on CosmosDB consistency settings, encryption/security, monitoring/metrics.

wahwah
Dec 1, 2020

Do you passed the exam after 24 Nov, is there any difference regarding these questions and the updated ones ? I mean if I prepared the exam with these version of a questions what is my chances to pass it ? thank you

dumpsm42
Dec 12, 2020

hi to all, answer: https://azure.microsoft.com/pt-pt/blog/general-availability-of-hdinsight-interactive-query-blazing-fast-data-warehouse-style-queries-on-hyper-scale-data-2/ sub-second ! Summary This week at Ignite, we are pleased to announce general availability of Azure HDInsight Interactive Query. Backed by our enterprise-grade SLA, HDInsight Interactive Query brings sub-second speed to data warehouse style SQL queries to the hyper-scale data stored in commodity cloud storage. regards

dfrp92
Jun 22, 2020

How does Spark meet the requirements? Spark does not provide data warehousing by itself, it is not a data store.

induna
Jun 24, 2020

Neither does any of the options, the last part of the question is key: Which type of cluster will you create, hence, Spark

Mittun
Oct 17, 2020

Apache Spark is correct Answer !! https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-overview

uomer
Nov 9, 2020

I also vote for Interactive Query as "An Interactive Query cluster is different from an Apache Hadoop cluster. It contains only the Hive service. Requirements: ✑ Provide data warehousing ( Yes) ✑ Reduce ongoing management activities (Not sure) ✑ Deliver SQL query responses in less than one second ( Yes)

Leonido
Feb 6, 2020

Would suggest to use the original link from MS: https://docs.microsoft.com/en-us/azure/cosmos-db/lambda-architecture as better background documentation

john_smith
Jul 14, 2020

The link you provided is redirected to What is Azure Synapse Link for Azure Cosmos DB (Preview)? wy?

AAJ
Mar 2, 2020

https://docs.microsoft.com/en-us/azure/cosmos-db/lambda-architecture

john_smith
Jul 14, 2020

The link you provided is redirected to What is Azure Synapse Link for Azure Cosmos DB (Preview)?

agmadeira
Jun 27, 2021

A - Interactive Query - "Deliver SQL query responses in less than one second" https://docs.microsoft.com/en-us/azure/hdinsight/interactive-query/apache-interactive-query-get-started

r8d1
Aug 10, 2020

i think the logic to answer this question is: Lambda architecture: https://databricks.com/glossary/lambda-architecture Azure implementation: https://azure.microsoft.com/en-us/services/databricks/ Azure Databricks = Fast, easy, and collaborative Apache SparkTM based analytics service

nehab0101
Oct 18, 2020

https://azure.microsoft.com/en-in/blog/lambda-architecture-using-azure-cosmosdb-faster-performance-low-tco-low-devops/

sunil08
Nov 6, 2020

D: Apache spark

Trivender
Jan 20, 2021

Correct Answer in Spark because it is in memory

Satya217
Feb 9, 2021

https://docs.microsoft.com/en-us/azure/cosmos-db/lambda-architectur

sandeep1111
Apr 6, 2021

correct

Hinzzz
Jun 20, 2021

D is correct based on Data warehousing requirement.