AWS Certified Data Engineer - Associate DEA-C01 Exam QuestionsBrowse all questions from this exam

AWS Certified Data Engineer - Associate DEA-C01 Exam - Question 13


A company stores daily records of the financial performance of investment portfolios in .csv format in an Amazon S3 bucket. A data engineer uses AWS Glue crawlers to crawl the S3 data.

The data engineer must make the S3 data accessible daily in the AWS Glue Data Catalog.

Which solution will meet these requirements?

Show Answer
Correct Answer: B

The solution involves creating an IAM role that includes the AWSGlueServiceRole policy, which is specifically designed for AWS Glue operations. The role should be associated with the crawler. The S3 bucket path containing the source data needs to be specified as the crawler's data store. Additionally, creating a daily schedule will ensure that the crawler runs on a daily basis. It is important to specify a database name for the output to ensure that the crawled data is properly cataloged in the AWS Glue Data Catalog. This setup meets all the stated requirements effectively.

Discussion

5 comments
Sign in to comment
TonyStark0122
Sep 21, 2024

B. Create an IAM role that includes the AWSGlueServiceRole policy. Associate the role with the crawler. Specify the S3 bucket path of the source data as the crawler's data store. Create a daily schedule to run the crawler. Specify a database name for the output. Explanation: Option B correctly sets up the IAM role with the necessary permissions using the AWSGlueServiceRole policy, which is designed for use with AWS Glue. It specifies the S3 bucket path of the source data as the crawler's data store and creates a daily schedule to run the crawler. Additionally, it specifies a database name for the output, ensuring that the crawled data is properly cataloged in the AWS Glue Data Catalog.

GiorgioGssOption: B
Mar 7, 2024

A,C are wrong because you use don't need full S3 access. D is wrong because you don't need to provision DPU and the destination should be a database, not an s3 bucket. so it's B

k350SecopsOption: B
May 10, 2024

Glue Crawlers are serverless. Assigning DPUs is the point where i decided it option B

plutonashOption: B
Jan 12, 2025

answer B is incomplete. Even we include AWSGlueServiceRole policy on IAM role, S3 access is not garantee

LrdKanien
Oct 26, 2024

How does Glue get access to S3 if you don't do B?

LrdKanien
Oct 26, 2024

I meant A

Asmunk
Nov 1, 2024

S3 access is part of the AWSGlueServiceRole Policy https://docs.aws.amazon.com/aws-managed- policy/latest/reference/AWSGlueServiceRole.html

sam_pre
Mar 24, 2025

It adds only for the glue related buckets, but it doesnt grant permissions for S3 that we need to read in order to fetch data, isnt it ?

sam_pre
Mar 24, 2025

It adds only for the glue related buckets, but it doesnt grant permissions for S3 that we need to read in order to fetch data, isnt it ?

Asmunk
Nov 1, 2024

S3 access is part of the AWSGlueServiceRole Policy https://docs.aws.amazon.com/aws-managed- policy/latest/reference/AWSGlueServiceRole.html

sam_pre
Mar 24, 2025

It adds only for the glue related buckets, but it doesnt grant permissions for S3 that we need to read in order to fetch data, isnt it ?

sam_pre
Mar 24, 2025

It adds only for the glue related buckets, but it doesnt grant permissions for S3 that we need to read in order to fetch data, isnt it ?