DP-201 Exam QuestionsBrowse all questions from this exam

DP-201 Exam - Question 104


You are designing a statistical analysis solution that will use custom proprietary Python functions on near real-time data from Azure Event Hubs.

You need to recommend which Azure service to use to perform the statistical analysis. The solution must minimize latency.

What should you recommend?

Show Answer
Correct Answer: C

Azure Databricks is the most suitable option for performing statistical analysis on near real-time data using custom proprietary Python functions. Azure Databricks provides a collaborative environment for data engineers and data scientists to run their custom Python functions efficiently. It supports notebook-based development, which is ideal for Python-based custom function execution. Additionally, Databricks is optimized for both batch processing and real-time data stream processing using Structured Streaming. Hence, it minimizes latency and meets the requirement for near real-time data analysis.

Discussion

10 comments
Sign in to comment
Mily94
May 8, 2021

Shouldn't be C (Databricks)? I think that the keyword is Python.

cadio30
May 26, 2021

Provided with the limited option, Azure Databricks is the appropriate solution as it can accommodate Python script

alf99
May 22, 2021

https://docs.microsoft.com/en-us/azure/stream-analytics/functions-overview Azure Stream Analytics supports the following four function types: JavaScript user-defined functions JavaScript user-defined aggregates C# user-defined functions (using Visual Studio) Azure Machine Learning Python is not on list so Databricks must be the right choice Answer: C

Ninja1
May 26, 2021

Agree, Databricks should be the answer

massnonn
Nov 11, 2021

why not A)Azure Synapse Analytics? support the python and nearl real time instead databricks is batch, or not?

DragonBlake
Jun 2, 2021

Answer is correct. ASA supports python https://docs.microsoft.com/en-us/python/api/overview/azure/mgmt-streamanalytics-readme?view=azure-python-preview

BitchNigga
Jun 2, 2021

Custom proprietary functions are written manually which need to be packaged and imported so yes databricks

Mandar77
Jun 10, 2021

You have to perform statistical analysis that what question says. Means what ever date you get on that you need to do it. I have found the link https://stackoverflow.com/questions/58097539/execute-azure-steaming-analytics-queries-from-a-python-script which talks about executing queries using python. The answer seems correct.

erssiws
Jun 19, 2021

Should be the Databricks which has better support for UDF in python

Deedubya
Aug 20, 2021

I agree it should be Databricks. Nowhere in Microsoft documentation does it say Azre Stream Analytics supports python. https://docs.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/stream-processing Python Reference document to show all integrated Azure services also does not list Azure Stream Analytics. https://docs.microsoft.com/en-us/python/api/overview/azure/?view=azure-python If someone finds something updated, please post!

deeps1390
Oct 12, 2021

Correct ans is B https://docs.microsoft.com/en-us/azure/event-hubs/event-hubs-python-get-started-send