SAS Institute Related Exams
A00-240 Exam

An analyst has a sufficient volume of data to perform a 3-way partition of the data into training, validation, and test sets to perform honest assessment during the model building process.
What is the purpose of the training data set?
What is the default method in the LOGISTIC procedure to handle observations with missing data?
An analyst knows that the categorical predictor, zip_code, is an important predictor of a binary target. However, zip_code has too many levels to be a feasible predictor in a model. The analyst uses PROC CLUSTER to implement Greenacre's method to reduce the number of categorical levels.
What is the correct application of Greenacre's method in this situation?