Certified Machine Learning Associate

Here you have the best Databricks Certified Machine Learning Associate practice exam questions

  • Preview the first 5 of 140 questions for free
  • These questions were last updated on May 5, 2026
  • This site is not affiliated with or endorsed by Databricks.
Question 1 of 140

A machine learning engineer has created a Feature Table new_table using Feature Store Client fs. When creating the table, they specified a metadata description with key information about the Feature Table. They now want to retrieve that metadata programmatically.

Which of the following lines of code will return the metadata description?

Answer

Suggested Answer

The suggested answer is C.

To retrieve the metadata description of a Feature Table that was created using the Feature Store Client, you can access the description attribute of the table object. Therefore, the line of code 'fs.get_table("new_table").description' will return the metadata description.

Community Votes13 votes
CSuggested
100%
Question 2 of 140

A data scientist has a Spark DataFrame spark_df. They want to create a new Spark DataFrame that contains only the rows from spark_df where the value in column price is greater than 0.Which of the following code blocks will accomplish this task?

Answer

Suggested Answer

The suggested answer is B.

To filter rows in a Spark DataFrame where the value in a specific column meets a certain condition, the `filter` method is used. The correct way to reference a column within this method is by using the `col` function from `pyspark.sql.functions`. Therefore, the correct code block to create a new DataFrame containing only rows where the 'price' column is greater than 0 is `spark_df.filter(col('price') > 0)`. Other provided options either use incorrect syntax or methods not suitable for a Spark DataFrame.

Community Votes5 votes
BSuggested
80%
A
20%
Question 3 of 140

A health organization is developing a classification model to determine whether or not a patient currently has a specific type of infection. The organization's leaders want to maximize the number of positive cases identified by the model.

Which of the following classification metrics should be used to evaluate the model?

Answer

Suggested Answer

The suggested answer is E.

Recall should be used to evaluate the model since it measures the proportion of actual positive cases that were correctly identified. This is particularly important for the health organization because they want to maximize the number of positive cases detected, thereby ensuring fewer positive cases are missed.

Community Votes2 votes
ESuggested
100%
Question 4 of 140

In which of the following situations is it preferable to impute missing feature values with their median value over the mean value?

Answer

Suggested Answer

The suggested answer is C.

When features contain a lot of extreme outliers, imputing missing values with the median is preferable over the mean because the median is less affected by extreme values. The mean can be skewed by these outliers, which would not provide an accurate central tendency of the data.

Community Votes2 votes
CSuggested
100%
Question 5 of 140

A data scientist has replaced missing values in their feature set with each respective feature variable’s median value. A colleague suggests that the data scientist is throwing away valuable information by doing this.

Which of the following approaches can they take to include as much information as possible in the feature set?

Answer

Suggested Answer

The suggested answer is D.

Creating a binary feature variable for each feature that contained missing values allows the model to identify whether any missing value was imputed. This retains the information about the missing values, which can sometimes hold predictive power and improve model performance.

Community Votes3 votes
DSuggested
100%

135 more questions await

Unlock the full Databricks Certified Machine Learning Associate question bank

5 of 140 completed4%

Choose your plan

One-time payment · No subscription · No hidden fees

Standard

Quick preparation

$25

30 days access

30 day access to all questions
Instant free updates
Highest passing rate in industry
Printable PDF download
No money-back guarantee
Best Value

Premium

Guaranteed success

$60$35

90 days access

PDF

Printable PDF download

New

Save every question as a PDF for offline study or printing.

90 day access to all questions
Instant free updates
Highest passing rate in industry
Pass guaranteed or money back

100% Money-Back Guarantee

Don't pass? Full refund.

4.9/5

Based on 4,568+ reviews

Trusted by thousands of professionals

Join certified professionals who passed their exams with Examice

Examice helped me pass my AWS certification on the first try! The questions were incredibly similar to the real exam. Comments helped me understand answers I was struggling with.
S
Sarah C.
Cloud Engineer
Great results in a short prep time. Passed on my first attempt.
D
David K.
Network Engineer
I needed to pass an exam for work, and this website delivered. The quality for the price is outstanding, and the support is really good. I passed without issues.
M
Michael R.
Security Analyst
Skeptical at first, but impressed. Every question included clear, detailed explanations.
L
Lisa M.
Solutions Architect
The guarantee gave me confidence to invest in the premium package. Turns out I didn't need it. Passed comfortably. The explanations for each answer were incredibly detailed and helped me grasp security concepts that I'd been struggling with for months.
R
Robert H.
Cybersecurity Consultant
Used Examice for my PMP certification. The questions were well structured and covered all exam domains thoroughly.
J
James T.
IT Manager
After failing my first attempt with other study materials, I switched to Examice and passed confidently on my second attempt.
A
Anna W.
Data Engineer
The premium package was worth it. 90 days of access gave me the flexibility to study when it worked for me, without feeling rushed.
E
Emily J.
DevOps Engineer
Straightforward questions that matched the real exam perfectly. Studied for two weeks and passed with a great score.
K
Karen P.
Systems Administrator

Frequently Asked Questions

Everything you need to know. Contact us for more.

Our Databricks Certified Machine Learning Associate questions are based on real exam experiences and are continuously updated to match the current exam format. We maintain a +99% pass rate because our questions closely mirror what you'll see on the actual exam.

With our Premium package, you get a 100% money-back guarantee. If you don't pass your exam after studying with our materials, simply contact us with your exam results and we'll refund your purchase. Terms and conditions apply, read our full refund policy to learn more.

Our question bank is updated regularly based on feedback from recent exam takers. We typically review and update our content every week with reports about new questions or changes to the exam format.

Standard package access cannot be extended. However, Premium package gives you 90 days which is typically more than enough time to prepare thoroughly. If you need additional time, you can purchase a new package at any time.

This is a one-time payment with no recurring charges. Once you purchase, you get full access to all exam questions for the duration of your package (30 days for Standard, 90 days for Premium). No hidden fees or automatic renewals.

Pass on your first try

All 140questions · Detailed explanations · Printable PDF · 90 days access

Money-back guaranteeSecure checkout
$35

one-time payment