SAS Statistical Business Analysis Using SAS 9: Regression and Modeling

Here you have the best SAS Institute A00-240 practice exam questions

You have 33 total questions to study from
Each page has 5 questions, making a total of 7 pages
You can navigate through the pages using the buttons at the bottom
This questions were last updated on June 23, 2025
This site is not affiliated with or endorsed by SAS Institute.

Question 1 of 33

When mean imputation is performed on data after the data is partitioned for honest assessment, what is the most appropriate method for handling the mean imputation?

The sample means from the validation data set are applied to the training and test data sets.

The sample means from the training data set are applied to the validation and test data sets.

The sample means from the test data set are applied to the training and validation data sets.

The sample means from each partition of the data are applied to their own partition.

Correct Answer: B

The sample means from the training data set are applied to the validation and test data sets. This is to ensure that the imputation strategy appropriately reflects the training data, which is the basis for model learning. Applying the mean imputation from the training set to the validation and test sets avoids data leakage and ensures a fair assessment of the model performance on unseen data.

Question 2 of 33

This question will ask you to provide missing code segments.

A logistic regression model was fit on a data set where 40% of the outcomes were events (TARGET=1) and 60% were non-events (TARGET=0). The analyst knows that the population where the model will be deployed has 5% events and 95% non-events. The analyst also knows that the company's profit margin for correctly targeted events is nine times higher than the company's loss for incorrectly targeted non-event.

Given the following SAS program:

What X and Y values should be added to the program to correctly score the data?

X=40, Y=10

X=.05, Y=10

X=.05, Y=.40

X=.10, Y=05

Correct Answer: C

The logistic regression model's priorevent parameter (X) should be set to the proportion of events in the population where the model will be deployed, which is 0.05. For the Y value, to correctly target events considering the company's profit margin, you need to set a cut-off probability for deciding if an observation should be classified as an event (TARGET=1). The ratio of the profit margin for correctly targeted events to the cost of incorrectly targeted non-events suggests a threshold calculation. Since the logistic model was originally fit on a population with 40% events, Y should be set to 0.40 to match this ratio. Thus, X should be 0.05 and Y should be 0.40.

Question 3 of 33

An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process.

What is the purpose of the training data set?

To provide an unbiased measure of assessment for the final model.

To compare models and select and fine-tune the final model.

To reduce total sample size to make computations more efficient.

To build the predictive models.

Correct Answer: D

The purpose of the training data set is to build the predictive models. The training data is used by the machine learning algorithm to learn patterns and make predictions. It forms the basis from which the model learns and optimizes its parameters.

Question 4 of 33

There are missing values in the input variables for a regression application.

Which SAS procedure provides a viable solution?

GLM

VARCLUS

STDI2E

CLUSTER

Correct Answer: C

When dealing with missing values in the input variables for a regression application, the SAS procedure that provides a viable solution is STDIZE (not STDI2E as there seems to be a typo). The STDIZE procedure is used to standardize variables, and it can handle missing values by computing statistics that exclude those missing values.

Question 5 of 33

Refer to the following exhibit:

What is a correct interpretation of this graph?

The association between the continuous predictor and the binary response is quadratic.

The association between the continuous predictor and the log-odds is quadratic.

The association between the continuous predictor and the continuous response is quadratic.

The association between the binary predictor and the log-odds is quadratic.

Correct Answer: B

The graph displays the empirical logit, which relates to the log-odds of a binary response variable, plotted against a continuous predictor variable (x). The graph shows a quadratic curve fit through these points, indicating that the relationship between the continuous predictor (x) and the log-odds of the binary response is quadratic.