March Sale Special - Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: top65certs

EMC E20-065 Dumps

Page: 1 / 2
Total 66 questions

Advanced Analytics Specialist Exam for Data Scientists Questions and Answers

Question 1

What is the maximum degree of a node in an undirected graph with 50 nodes'?

Options:

A.

49

B.

50

C.

1250

D.

2500

Question 2

A hotel chain runs a simul-ation on room pricing. They want to estimate revenue, per hotel, within +/- $10 with 95% confidence (Za/2=1.96). The estimated revenue standard deviation is $5000 based on previous booking data.

What is the optimal number of simulation trials to run?

    Options:

    A.

    A 32-bit operating system was used

    B.

    The same number of trials was used

    C.

    A linear congruential generator (LCG) was used (or pseudo-random number generation

    D.

    Different seeds tor the random number generator were used.

    Question 3

    What does YARN provide over and above MapReduce?

    Options:

    A.

    Separate cluster and resource management

    B.

    Parallelized processing

    C.

    Serialized processing

    D.

    Access to HDFS data

    Question 4

    Which problem type is best suited for simulation?

    Options:

    A.

    One with a few. non-random input variables

    B.

    One that has a closed-form solution

    C.

    One with numerous, non-random Input-variables

    D.

    One that compares "what-if scenarios

    Question 5

    How can you improve processing performance in HIVE?

    Options:

    A.

    Partition tables

    B.

    Run the SET hive.exec.parallel = false command

    C.

    Ensure highly normalized tables and use joins

    D.

    Minimize bucketing

    Question 6

    How is the relative value of a node visualized in a sunburst?

    Options:

    A.

    Color

    B.

    Area

    C.

    Gradient

    D.

    Position

    Question 7

    What is an important simu-lation design consideration?

      Options:

      A.

      Ensure model Inputs align with reality

      B.

      Use different seed values to regenerate results

      C.

      For rare event models, minimize number of trials

      D.

      A complex model is better than a simple model

      Question 8

      What runs more efficiently because of Apache Tez?

      Options:

      A.

      Pig and Hive

      B.

      Hive and HBase

      C.

      Yarn and Spark

      D.

      All MapReduce jobs

      Question 9

      What process must address acoustic ambiguity in NLP?

      Options:

      A.

      Part-of-speech tagging

      B.

      Word sense disambiguation

      C.

      Speech recognition

      D.

      Discourse

      Page: 1 / 2
      Total 66 questions