Exam DEA-C01 All QuestionsBrowse all questions from this exam
Question 98

A company wants to use machine learning (ML) to perform analytics on data that is in an Amazon S3 data lake. The company has two data transformation requirements that will give consumers within the company the ability to create reports.

The company must perform daily transformations on 300 GB of data that is in a variety format that must arrive in Amazon S3 at a scheduled time. The company must perform one-time transformations of terabytes of archived data that is in the S3 data lake. The company uses Amazon Managed Workflows for Apache Airflow (Amazon MWAA) Directed Acyclic Graphs (DAGs) to orchestrate processing.

Which combination of tasks should the company schedule in the Amazon MWAA DAGs to meet these requirements MOST cost-effectively? (Choose two.)

    Correct Answer: A, D

    AWS Glue crawlers are appropriate for daily incoming data because they can effectively scan and identify the schema stored in Amazon S3, which is cost-effective. Amazon EMR is suitable for performing data transformations on both daily and archived data. It is flexible and can handle large-scale data transformations efficiently, making it a practical choice for terabytes of archived data and the daily influx of 300 GB of data.

Discussion
Ja13

A. For daily incoming data, use AWS Glue crawlers to scan and identify the schema. D. For daily and archived data, use Amazon EMR to perform data transformations. Here's why: A. AWS Glue crawlers are well-suited for scanning and identifying the schema of data in S3. They are cost-effective and efficient for daily incoming data. D. Amazon EMR is a cost-effective solution for performing large-scale data transformations. It can handle both the daily transformations of 300 GB of data and the one-time transformations of terabytes of archived data efficiently.

tgvOptions: AD

A. For daily incoming data, use AWS Glue crawlers to scan and identify the schema. This is cost-effective and simplifies the process of managing metadata. D. For daily and archived data, use Amazon EMR to perform data transformations. EMR is suitable for both large-scale and regular transformations, offering flexibility and cost efficiency.

HunkyBunky

A / D - Looks good for me

Ja13Options: AD

According to ChatGPT