Professional Machine Learning Engineer Exam - Question 213

Question

You are building a TensorFlow text-to-image generative model by using a dataset that contains billions of images with their respective captions. You want to create a low maintenance, automated workflow that reads the data from a Cloud Storage bucket collects statistics, splits the dataset into training/validation/test datasets performs data transformations trains the model using the training/validation datasets, and validates the model by using the test dataset. What should you do?

Examice · Accepted Answer

To create an automated workflow for a TensorFlow text-to-image generative model, the TensorFlow Extended (TFX) SDK is the most suitable. TFX provides specialized components for ingesting data, transforming data, training models, and validating them, which aligns well with the requirements. Deploying the workflow on Vertex AI Pipelines ensures a managed and scalable environment, facilitating integration with Google Cloud services such as Dataflow and Vertex AI for streamlined processing and training.

pikachu007 · Answer

Airflow (A): While versatile, Airflow often requires more manual configuration and integration with ML services, potentially increasing maintenance effort.
MLFlow (B): MLFlow focuses on experiment tracking and model management, lacking built-in pipeline components for data processing and model training.
Kubeflow Pipelines (C): KFP is flexible but requires more setup and infrastructure management compared to TFX's managed services.

BlehMaks · Answer

https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

winston9 · Answer

C and D are valid options. if the model is created in TF, use TFX, in any other case, use KFP; therefore, here is D

fitri001 · Answer

KFP Pipelines: Kubeflow Pipelines (KFP) is a popular open-source framework for building and deploying machine learning workflows. It provides a user-friendly SDK for defining pipelines as components and simplifies workflow orchestration.
Vertex AI Pipelines Integration: Vertex AI Pipelines is a managed service from Google Cloud that integrates seamlessly with KFP. You can deploy your KFP-defined workflow on Vertex AI Pipelines, leveraging its features like scheduling, monitoring, and versioning.
Dataflow and Vertex AI Services: Both Dataflow and Vertex AI are Google Cloud services well-suited for this workflow

PhilipKoku · Answer

D) TFX is the way forward as it has services to support every step of the use case presented.

pinimichele01 · Answer

If you use TensorFlow in an ML workflow that processes terabytes of structured data or text data, we recommend that you build your pipeline using TFX.
For other use cases, we recommend that you build your pipeline using the Kubeflow Pipelines SDK

https://cloud.google.com/vertex-ai/docs/pipelines/build-pipeline#sdk

dija123 · Answer

Agree with TFX

Professional Machine Learning Engineer Exam - Question 213

Discussion