Professional Machine Learning Engineer Exam QuestionsBrowse all questions from this exam

Professional Machine Learning Engineer Exam - Question 213


You are building a TensorFlow text-to-image generative model by using a dataset that contains billions of images with their respective captions. You want to create a low maintenance, automated workflow that reads the data from a Cloud Storage bucket collects statistics, splits the dataset into training/validation/test datasets performs data transformations trains the model using the training/validation datasets, and validates the model by using the test dataset. What should you do?

Show Answer
Correct Answer: AD

To create an automated workflow for a TensorFlow text-to-image generative model, the TensorFlow Extended (TFX) SDK is the most suitable. TFX provides specialized components for ingesting data, transforming data, training models, and validating them, which aligns well with the requirements. Deploying the workflow on Vertex AI Pipelines ensures a managed and scalable environment, facilitating integration with Google Cloud services such as Dataflow and Vertex AI for streamlined processing and training.

Discussion

7 comments
Sign in to comment
pikachu007Option: D
Jan 13, 2024

Airflow (A): While versatile, Airflow often requires more manual configuration and integration with ML services, potentially increasing maintenance effort. MLFlow (B): MLFlow focuses on experiment tracking and model management, lacking built-in pipeline components for data processing and model training. Kubeflow Pipelines (C): KFP is flexible but requires more setup and infrastructure management compared to TFX's managed services.

pinimichele01
Apr 8, 2024

https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

BlehMaksOption: C
Jan 16, 2024

https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

winston9Option: D
Jan 19, 2024

C and D are valid options. if the model is created in TF, use TFX, in any other case, use KFP; therefore, here is D

pinimichele01
Apr 8, 2024

https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

fitri001Option: C
Apr 18, 2024

KFP Pipelines: Kubeflow Pipelines (KFP) is a popular open-source framework for building and deploying machine learning workflows. It provides a user-friendly SDK for defining pipelines as components and simplifies workflow orchestration. Vertex AI Pipelines Integration: Vertex AI Pipelines is a managed service from Google Cloud that integrates seamlessly with KFP. You can deploy your KFP-defined workflow on Vertex AI Pipelines, leveraging its features like scheduling, monitoring, and versioning. Dataflow and Vertex AI Services: Both Dataflow and Vertex AI are Google Cloud services well-suited for this workflow

fitri001
Apr 18, 2024

why not others? A. Airflow with Dataflow and Vertex AI: While Airflow is a powerful workflow management tool, deploying it on Cloud Composer adds additional complexity compared to the managed environment of Vertex AI Pipelines. B. MLflow with Dataflow and Vertex AI: MLflow focuses primarily on model lifecycle management. While it can be used for building pipelines, KFP offers a more specialized and user-friendly approach for this specific use case. D. TFX with Dataflow and Vertex AI: TFX is a comprehensive end-to-end ML platform. While it offers several functionalities, it might be an overkill for this scenario focusing on data processing, training, and validation. KFP provides a simpler solution for this specific workflow.

PhilipKokuOption: D
Jun 10, 2024

D) TFX is the way forward as it has services to support every step of the use case presented.

pinimichele01Option: C
Apr 8, 2024

If you use TensorFlow in an ML workflow that processes terabytes of structured data or text data, we recommend that you build your pipeline using TFX. For other use cases, we recommend that you build your pipeline using the Kubeflow Pipelines SDK https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

dija123Option: D
Jun 28, 2024

Agree with TFX