MLS-C01 Exam QuestionsBrowse all questions from this exam

MLS-C01 Exam - Question 305


A company wants to forecast the daily price of newly launched products based on 3 years of data for older product prices, sales, and rebates. The time-series data has irregular timestamps and is missing some values.

Data scientist must build a dataset to replace the missing values. The data scientist needs a solution that resamples the data daily and exports the data for further modeling.

Which solution will meet these requirements with the LEAST implementation effort?

Show Answer
Correct Answer: C

Amazon SageMaker Studio Data Wrangler is designed for visual and interactive data preparation, making it easy for data scientists to clean, transform, and analyze data with minimal code. It offers over 250 built-in transformations including resampling, which is required to achieve daily resampling of time-series data. Additionally, Data Wrangler provides the capability to handle missing values efficiently and export the prepared dataset for further modeling. This makes it the most suitable option with the least implementation effort for the given requirements.

Discussion

8 comments
Sign in to comment
GS_77Option: B
Aug 23, 2024

While SageMaker Data Wrangler (option C) is also a strong contender, DataBrew is slightly easier to use and requires even less implementation effort, especially for users who may not be as familiar with the SageMaker ecosystem.

spinatram
Nov 3, 2024

C is the right one I think. For B, you should to feed this data to sagemaker which brings more operational effort than Data Wrangler

kyuhuckOption: C
Feb 7, 2024

Answer: C Explanation: Amazon SageMaker Studio Data Wrangler is a visual data preparation tool that enables users to clean and normalize data without writing any code. Using Data Wrangler, the data scientist can easily import the time-series data from various sources, such as Amazon S3, Amazon Athena, or Amazon Redshift. Data Wrangler can automatically generate data insights and quality reports, which can help identify and fix missing values, outliers, and anomalies in the data. Data Wrangler also provides over 250 built-in transformations, such as resampling, interpolation, aggregation, and filtering, which can be applied to the data with a point-and-click interface. Data Wrangler can also export the prepared data to different destinations, such as Amazon S3, Amazon SageMaker Feature Store, or Amazon SageMaker Pipelines, for further modeling and analysis. D

akdavsanOption: C
Feb 25, 2024

This is exactly what Data Wrangler is for

AdzzOption: C
Feb 27, 2024

Best for Data Wrangler

AIWaveOption: C
Mar 9, 2024

Data wrangler supports tight integration with Sagemaker and is better suited for this scenario since resampled data is used in further modelling. AWS Glue DataBrew is a data preparation service more for general purpose use.

vkbajoriaOption: C
Mar 23, 2024

Data Wrangler is better for ML work. Brew can be used as well

TogyOption: B
Mar 21, 2025

There is a need for scheduling daily resampling. This can be automated in Databrew more easily than in Data Wrangler.

youonebeOption: C
May 5, 2025

Databrew lacks explicit time-series resampling features; focuses on general ETL, not forecasting workflows.