Labour Day Special - Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: top65certs

SAS Institute A00-240 Actual Questions

Page: 2 / 4
Total 99 questions

SAS Statistical Business Analysis SAS9: Regression and Model Questions and Answers

Question 5

The PROC LOGISTIC options SELECTION=SCORE and BEST=2 are used in a MODEL statement to generate a series of predictive models. The models are assigned numbers in order from 1 to 99 reflecting the fact that there are 50 candidate input variables. Results from the collection of derived models are used to generate the following plot of overall average profit by model number. Results are restricted to models with at least 9 inputs and at most 40 inputs.

The maximum value for the training data occurs for model number 46, and the maximum value for the validation data occurs for model number 43.

If you base model selection solely on overall average profit, what is the correct choice?

Options:

A.

Select model 46

B.

Select model 43

C.

Select model 45

D.

Select model 21

Question 6

The SAS data set RESULT contains the following variables:

  • Region (GrpA or GrpB)
  • Sales (dollars per year)

Which SAS programs can be used to find the p-value for comparing GrpA sales with GrpB sales? (Choose two.)

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Question 7

An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process.

What is the purpose of the training data set?

Options:

A.

To provide an unbiased measure of assessment for the final model.

B.

To compare models and select and fine-tune the final model.

C.

To reduce total sample size to make computations more efficient.

D.

To build the predictive models.

Question 8

A predictive model uses a data set that has several variables with missing values.

What two problems can arise with this model? (Choose two.)

Options:

A.

The model will likely be overfit.

B.

There will be a high rate of collinearity among input variables.

C.

Complete case analysis means that fewer observations will be used in the model building process.

D.

New cases with missing values on input variables cannot be scored without extra data processing.

Page: 2 / 4
Total 99 questions