DEA-C01 Exam QuestionsBrowse all questions from this exam

DEA-C01 Exam - Question 114


A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format. The company must store the files in Apache Parquet format.

Which solution will meet these requirements with the LEAST development effort?

Show Answer
Correct Answer: B,D

To meet the requirements with the least development effort, use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Then use Kinesis Data Firehose to store the files in Parquet format. Kinesis Data Firehose can convert JSON to Parquet before storing the data in S3, but since the source data is in .csv format, an intermediate step involving an AWS Lambda function is necessary to transform .csv files to JSON.

Discussion

5 comments
Sign in to comment
qwertyuioOption: D
Jul 17, 2024

https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html

LR2023
Jul 17, 2024

why do you need lambda in the middle, per you link Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3...my choice is B

AlagongOption: B
Jun 30, 2024

By using the built-in transformation and format conversion features of Kinesis Data Firehose, you achieve the desired result with minimal custom development, thereby meeting the requirements efficiently and cost-effectively.

Bmaster
Jun 29, 2024

D is good https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html

Bmaster
Jun 30, 2024

" If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information"

HunkyBunkyOption: B
Jul 2, 2024

B - least development efforts

mzansikiller
Jul 21, 2024

Answer D https://docs.aws.amazon.com/firehose/latest/dev/record-format-conversion.html Amazon Data Firehose can convert the format of your input data from JSON to Apache Parquet or Apache ORC before storing the data in Amazon S3. Parquet and ORC are columnar data formats that save space and enable faster queries compared to row-oriented formats like JSON. If you want to convert an input format other than JSON, such as comma-separated values (CSV) or structured text, you can use AWS Lambda to transform it to JSON first. For more information, see Transform data in Amazon Data Firehose.