SAP-C02 Exam QuestionsBrowse all questions from this exam

SAP-C02 Exam - Question 239


A company is developing a gene reporting device that will collect genomic information to assist researchers with collecting large samples of data from a diverse population. The device will push 8 KB of genomic data every second to a data platform that will need to process and analyze the data and provide information back to researchers. The data platform must meet the following requirements:

• Provide near-real-time analytics of the inbound genomic data

• Ensure the data is flexible, parallel, and durable

• Deliver results of processing to a data warehouse

Which strategy should a solutions architect use to meet these requirements?

Show Answer
Correct Answer: B

To meet the requirement of near-real-time analytics, flexibility, parallelism, and durability, Amazon Kinesis Data Streams is an appropriate choice for collecting inbound sensor data. Kinesis Data Streams can handle large amounts of streaming data while providing the necessary scale and parallelism. For the analytics and processing, utilizing Kinesis clients ensures that the heavy lifting is done in real-time. Finally, using Amazon Redshift, which is a powerful data warehousing solution, ensures that the results are stored efficiently and are ready for complex querying and analysis, thereby fulfilling the data warehouse requirement.

Discussion

12 comments
Sign in to comment
shaaam80Option: B
Nov 30, 2023

Answer B. Option A might be close enough, near-real time, which is Firehose, but the target is RDS but the ask is for Datawarehouse (Redshift)

bhanusOption: B
Jun 22, 2023

B is correct B - Kinesis Data Streams is a real-time streaming service and provide near-real-time analytics. Also the question "Deliver results of processing to a data warehouse" and this option has redshift cluster which is a powerful data warehousing solution that can handle large-scale analytics workloads. A - incorrect because Kinesis Data Firehose is NOT ideal for near-real-time analytics and may introduce some latency in the data processing pipeline. Additionally, saving the results to an Amazon RDS instance may not provide the scalability and flexibility required for processing and analyzing large volumes of genomic data.

bhanus
Jun 26, 2023

What a worst framed ques. The ques says "NEAR real time" which means its Kinesis data firehose. But this option has RDS which is not good for analysis

bhanus
Jun 30, 2023

Between A and B, B is better because questions asks for data warehousing capabilities. So option B has Redshift which is correct answer.

SmileyCloudOption: B
Jun 26, 2023

B. Real-time is either firehose (A) or streams (B). But they require a data warehouse and that's RedShift not RDS.

MoussaNoussa
Jun 22, 2023

Answer B is the right one

nexus2020Option: B
Jun 23, 2023

B is the one for real time

easytoo
Jun 23, 2023

b=b=b=b=b=b=b

NikkyDickyOption: B
Jul 7, 2023

B for sure

career360guruOption: B
Nov 24, 2023

Option B

tmlong18Option: B
Jan 14, 2024

'parallel'

DgixOption: B
Mar 21, 2024

Correct answer is B.

bjexamprepOption: D
Apr 19, 2024

Kinesis client is a library. Users need to write an application with the Kinesis Client Library to use it. Both A and B states “analyze the data with Kinesis clients” without mentioning how the application is written and deployed. So, both A and B are out, cause the deployment model is the key of the question to satisfy the requirement. C has an incorrect statement “analyze the data from Amazon SQS with Kinesis” D is a feasible solution.

jopaca1216
Jul 21, 2024

SQS is not near real time

Win007
Jun 6, 2024

Dis the right Answer