Month End Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

Free and Premium EMC D-DS-FN-23 Dumps Questions Answers

Page: 1 / 4
Total 59 questions

Dell Data Science Foundations Questions and Answers

Question 1

On which type of data should you run K-means clustering?

Options:

A.

Ordinal

B.

Numeric

C.

Text

D.

Nominal

Buy Now
Question 2

Refer to the exhibit.

What is the approximate R-squared value for a linear regression model fitted to the data associated with this scatterplot?

Options:

A.

4

B.

0.96

C.

0.25

D.

16

Question 3

After which phase of the data analytics lifecycle should you determine if the model needs any recalibration?

Options:

A.

Model planning

B.

Data preparation

C.

Discovery

D.

Operationalize

Question 4

What type of variable is the dependent variable from a logistic regression?

Options:

A.

Categorical

B.

Continuous

C.

Ratio

D.

Interval

Question 5

In K-means clustering, what is a graph of the WSS versus the value of K used to help determine?

Options:

A.

Optimal distance between clusters

B.

Average distance between observations

C.

'Optimal number of clusters

D.

Average distance between clusters

Question 6

When using association rules, what is an itemset?

Options:

A.

Set of continuous variables that are linked

B.

Set of discrete variables that are linked

C.

Support

D.

Confidence

Question 7

When building a K-means clustering model, you notice that the clusters did not segment on variables that you expected. What should you do?

Options:

A.

Decrease the value of K

B.

Multiply each variable by its standard deviation

C.

Add the WSS to each variable

D.

Check that the data was properly scaled

Question 8

Refer to the exhibit.

To predict whether or not a customer will renew their annual property insurance policy, an insurance company built and operationalized a naïve Bayes classification model. In the model, there are two class labels, renewal and non-renewal, that are assigned to each customer based on their attributes.

A subset of the key attributes, their values, and corresponding conditional probabilities are provided in the exhibit.

A customer has the following attributes:

● Age is greater than 65 years

● Owns their own home

● Renewal month is August

If 20% of customers do not renew the police every year, what is the score for a renewal in the naïve Bayesian model for the customer described above?

Options:

A.

0.0022

B.

0 0027

C.

0.0270

D.

0.0216

Question 9

What metrics are used to help calculate relevance in text analysis?

Options:

A.

TF and R square

B.

IDF and information gain

C.

Information gain and confidence interval

D.

TF and IDF

Question 10

Which SQL OLAP grouping extension returns a result for each output row with 1 identifying a summary row and 0 identifying grouped rows?

Options:

A.

CUBE

B.

GROUPING

C.

GROUP ID

D.

ROLLUP

Question 11

What does “MAD” in MADlib stand for?

Options:

A.

Magnetic Association Design

B.

Magnetic Agile Deep

C.

Multiple Agile Development

D.

Multiple Access Design

Question 12

What data asset is an example of quasi-structured data?

Options:

A.

Excel file

B.

Clickstream data

C.

Relational database table

D.

Comma-separated value file

Question 13

What is the purpose of applying the naïve Bayes conditional independence assumption?

Options:

A.

To simplify the probability calculations

B.

To calculate the probability of rare events

C.

To minimize rounding errors in probability calculations

D.

To accurately calculate each probability

Question 14

Match each task to its description.

Options:

Question 15

What is a key consideration when preparing a presentation intended for analysts?

Options:

A.

Describe how to implement the model

B.

Provide talking points to promote or evangelize the project

C.

Emphasize the business benefits of implementing the model

D.

Focus on clean simple-to-understand visuals

Question 16

What action occurs during feature selection in the model building phase of the data analytics lifecycle?

Options:

A.

Create new combinations of attributes

B.

Overfit the model to improve prediction accuracy

C.

Identify the most useful input variables

D.

Select a superset of variables to shorten training times

Question 17

Which R function plots a distribution of a single variable along two different axes?

Options:

A.

table()

B.

summaryQ

C.

density ()

D.

rug()

Page: 1 / 4
Total 59 questions