Professional Machine Learning Engineer Exam - Question 162

Question

You built a custom ML model using scikit-learn. Training time is taking longer than expected. You decide to migrate your model to Vertex AI Training, and you want to improve the model’s training time. What should you try out first?

Examice · Accepted Answer

Scikit-learn, the library in question, does not natively support GPU training. It is specifically designed to run on CPUs. While distributed computing can speed up training time for large datasets, the first step should be training your model using Vertex AI Training with CPUs to leverage any optimizations and scalability offered by Vertex AI specifically for CPU-based training.

b1a8fae · Answer

SK-Learn offers no GPU support. Answer is B!

AnnaR · Answer

B is correct, because scikit only has CPU support for the following services: 
- prebuilt containers for custom training (this is the case here)
- prebuilt containers for predictions and explanations 
- Vertex AI Pipelines 
- Vertex AI Workbench user-managed notebooks
https://cloud.google.com/vertex-ai/docs/supported-frameworks-list#scikit-learn_2

mlx · Answer

B. Train your model using Vertex AI Training with CPUs.
No GPUs for ScikitLearn, but parrallelize/distribute training is a good way to increase model building

vale_76_na_xxx · Answer

Why no A?

VMHarry · Answer

GPU helps speeding up training process

guilhermebutzke · Answer

Scikit-learn doesn't natively support GPUs for training. However, many scikit-learn algorithms rely on libraries like NumPy and SciPy. These libraries can leverage GPUs if they're available on the system, potentially benefiting scikit-learn models indirectly.

Carlose2108 · Answer

scikit-learn no GPU support.

AzureDP900 · Answer

You decided to migrate to Vertex AI, If you have a model that requires significant computational resources and doesn't rely heavily on specialized GPU operations (like those in option D), then option B might still be a good choice. However, if your model is computationally intensive or involves complex neural network architectures I would go with D instead of B.

TanTran04 · Answer

Scikit-learn is not intended to be used as a deep-learning framework and it does not provide any GPU support. (Ref: https://stackoverflow.com/questions/41567895/will-scikit-learn-utilize-gpu).
So I go with B

Professional Machine Learning Engineer Exam - Question 162

Discussion