Weekend Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

Free DY0-001 Questions Attempt

Page: 3 / 6
Total 85 questions

CompTIA DataX Exam Questions and Answers

Question 9

A data analyst is examining the correlation matrix of a new data set to identify issues that could adversely impact model performance. Which of the following is the analyst most likely checking for?

Options:

A.

Undersampling

B.

Multicollinearity

C.

Oversampling

D.

Overfitting

Question 10

A data scientist is using the following confusion matrix to assess model performance:

Actually Fails

Actually Succeeds

Predicted to Fail

80%

20%

Predicted to Succeed

15%

85%

The model is predicting whether a delivery truck will be able to make 200 scheduled delivery stops.

Every time the model is correct, the company saves 1 hour in planning and scheduling.

Every time the model is wrong, the company loses 4 hours of delivery time.

Which of the following is the net model impact for the company?

Options:

A.

25 hours lost

B.

25 hours saved

C.

165 hours lost

D.

165 hours saved

Question 11

A data scientist is working with a data set that covers a two-year period for a large number of machines. The data set contains:

    Machine system ID numbers

    Sensor measurement values

    Daily timestamps for each machine

The data scientist needs to plot the total measurements from all the machines over the entire time period. Which of the following is the best way to present this data?

Options:

A.

Scatter plot

B.

Line plot

C.

Histogram

D.

Box-and-whisker plot

Question 12

Which of the following distance metrics for KNN is best described as a straight line?

Options:

A.

Radial

B.

Euclidean

C.

Cosine

D.

Manhattan

Page: 3 / 6
Total 85 questions