Month End Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

Free and Premium DASCA SDS Dumps Questions Answers

Page: 1 / 6
Total 85 questions

Senior Data Scientist Questions and Answers

Question 1

IoT is built on:

Options:

A.

Cloud Computing

B.

Networks of data gathering devices

C.

Both A and B

D.

None of the above

Buy Now
Question 2

Which of the following is TRUE for "By" analysis?

Options:

A.

The "By" analysis technique reinforces the process of "thinking like a data scientist."

B.

"By" analysis is a technique by which business subject matter experts (SMEs) and the Data Science team could collaborate to uncover new variables and metrics that might be better predictors of business performance.

C.

"By" analysis is used to create a collaborative technique to drive alignment between the business users and the data scientists to identify and brainstorm variables and metrics that might be better predictors of business performance.

D.

Both B and C

E.

All of the above

Question 3

Exploratory analytic algorithms help the Data Science team to better:

Options:

A.

Understand the data content

B.

Gain a high-level understanding of relationships

C.

Understand patterns in the data

D.

Both A and B

E.

All of the above

Question 4

Which of these are open-source column-oriented databases?

Options:

A.

Cassandra

B.

HBase

C.

Accumulo

D.

Both A and B

E.

All of the above

Question 5

Designing an algorithm to play chess is usually an example of which type of machine learning?

Options:

A.

Reinforcement learning

B.

Pattern density

C.

Supervised learning

D.

Clustering

Question 6

Which of the following statements is correct?

Options:

A.

Apache claimed that Spark is able to run parallel jobs 100 times faster in memory and 10 times faster on disk in comparison to the traditional Hadoop MapReduce

B.

Apache claimed that Spark is able to run parallel jobs 10 times faster in memory and 100 times faster on disk in comparison to the traditional Hadoop MapReduce

C.

Apache claimed that Spark is able to run parallel jobs 1000 times faster in memory and 100 times faster on disk in comparison to the traditional Hadoop MapReduce

D.

Apache claimed that Spark is able to run parallel jobs 50 times faster in memory and 5 times faster on disk in comparison to the traditional Hadoop MapReduce

Question 7

Tar is an example of:

Options:

A.

Archive file format

B.

CSV file format

C.

ARV file format

D.

Text file format

E.

None of the above

Question 8

Self-driving car is an example of:

Options:

A.

Supervised learning

B.

Unsupervised learning

C.

Reinforcement learning

D.

All of the above

Question 9

Which of the following is a trend analysis component of time series decomposition?

Options:

A.

Cyclical

B.

Seasonal

C.

Irregular

D.

Both A and B

E.

All of the above

Question 10

The Big Data Vision Workshop process is ideal for organizations who:

Options:

A.

Have a desire to leverage Big Data to transform their business but do not know where and how to start

B.

Have a wealth of data that they do not know how to monetize

C.

Have a desire to leverage the Big Data Vision Workshop to identify where and how to leverage data and analytics to power their business models

D.

Both A and B

E.

All of the above

Question 11

Which of the following is NOT an example of graphical model?

Options:

A.

Road maps

B.

Electrical circuits

C.

Computer networks

D.

Geographical networks

E.

Flow charts

Question 12

Spark programs can be written in:

Options:

A.

Java

B.

Scala

C.

Python

D.

All of the above

E.

None of the above

Question 13

Which of the following is main Machine Learning Library in Python?

Options:

A.

NumPy

B.

Scikit-learn

C.

Matplotlib

D.

SciPy

E.

None of the above

Question 14

Which of the following is the most important part of Hadoop?

Options:

A.

Hadoop Distributed File System (HDFS)

B.

MapReduce Framework

C.

Spark Framework

D.

Both A and B

E.

Both B and C

Question 15

Which of the following is NOT a correct situation to use Agile?

Options:

A.

When the final product isn’t clearly defined

B.

When clients/stakeholders need to be able to change the scope

C.

When changes need to be implemented during the entire process

D.

None of the above

Question 16

What is the agenda of discussion at a "stand up" meeting of an Agile team?

Options:

A.

What they accomplished the previous day

B.

What they are planning to do today

C.

Any roadblocks they are running into

D.

Both A and B

E.

All of the above

Question 17

ElementTree sub-library gives us direct access to:

Options:

A.

Parse tree of the XML

B.

Delete tree of the XML

C.

Copy tree of the XML

D.

Insert tree of the XML

E.

None of the above

Question 18

Which of the following is a useful feature of functional programming?

Options:

A.

Higher-Order Functions (HOFs)

B.

Immutable Data

C.

Lazy Evaluation

D.

All of the above

Question 19

Machine learning can be categorized as:

Options:

A.

Supervised learning

B.

Unsupervised learning

C.

Reinforcement learning

D.

All of the above

Question 20

Which of the following phases is NOT a Big Data Business Model Maturity Index?

Options:

A.

Business Monitoring

B.

Business Optimization

C.

Business Strategy

D.

Data Monetization

E.

Business Metamorphosis

Question 21

Which of the following is an SLAs specification in case of Internet Service Provider (ISP)?

Options:

A.

Mean Time Between Failures (MTBF)

B.

Mean Time To Recovery (MTTR)

C.

Turnaround Time (TAT)

D.

All of the above

Question 22

Which of the following is NOT a main data container in Python?

Options:

A.

Lists

B.

Tuples

C.

LinkedList

D.

Dict

Question 23

Image files can be broken down into two broad categories:

i. Rasterized

ii. Vectorized

iii. Sectorized

Options:

A.

i, ii

B.

ii, iii

C.

i, iii

D.

None of the above

Question 24

Which classification steps are performed in inductive techniques?

i. Training Step

ii. Test Step

iii. Validation Step

iv. Application Step

Options:

A.

i, ii

B.

ii, iii

C.

i, ii, iv

D.

i, ii, iii, iv

Question 25

Which of the following is NOT a cluster management tool?

Options:

A.

Zettaset Orchestrator

B.

Apache Mesos

C.

Apache Ambari

D.

Apache Hadoop

Page: 1 / 6
Total 85 questions