March Sale Special - Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: top65certs

Huawei H13-723_V2.0 Dumps

Page: 1 / 20
Total 526 questions

HCIP-Big Data Developer V2.0 Questions and Answers

Question 1

aboutHiveandHadoopThe relationship of other components, the following description is wrong?

Options:

A.

Hivefinally store the data inHDFSmiddle

B.

HQLable to passMapReduceperform tasks

C.

HiveYesHadoopThe platform's data warehouse tools

D.

HiverightHBasehave strong dependencies

Question 2

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Question 3

aboutKafkaThe characteristics of the following description are correct? (multiple choice)

Options:

A.

KafkaIt is a high-throughput, distributed, publish-subscribe-based messaging system

B.

KafkaPersistence of messages

C.

KafkaApplicable to offline and online message consumption scenarios

D.

Kafkaguarantee eachPartitionmessages in order

Question 4

existKafka, as follows aboutProducerWhat is wrong with the statement of sending data?

(multiple choice)

Options:

A.

ProducerResponsible for production data,Consumerresponsible for consuming data,ProducerandConsumerneed between

B.

EstablishSocketconnect

C.

Producercan either send data toBroker, and can send data toConsumersuperiorProducer

D.

As a message producer, you can directly write data toZooKeeper Producerconnect any normalBrokerInstances can produce data

Question 5

FlinkusecheckpointThe mechanism guarantees fault tolerance in the operation of the application.

Options:

A.

True

B.

False

Question 6

Which of the following scenarios is notflinkWhat does the component excel at()?

(multiple choice)

Options:

A.

Batch Iterative

B.

Computing Stream

C.

Processing

D.

Data Storage

Question 7

aboutStreamingthe topology (Topology), which of the following descriptions is wrong?

Options:

A.

OneTopologyis made up of a group ofSpoutcomponents andBotcomponents passStream GroupingsA connected directed acyclic graph (DAG)

B.

Topologywill run until it is explicitlykill

C.

Business logic is encapsulated inTopologymiddle

D.

OneTopologyOnly one can be specified to startWorkerprocess

Question 8

FusionInsight HD HBasein the cluster,Table1belongNamespace1,Table2belong

Namespace2,Table1There are two column families, namelycf11,c12,Table2There is a column family namedcf21,

Which of the following options will allow the user accountAalso havecf11andcf21read and write permissions. (multiple choice)

Options:

A.

Assign this user accountglobalread permission

B.

Give this user accountNamespace1read and write permissions

C.

Assign this user accountTable1andTable2read and write permissions

D.

Assign this user accountNamespace1andNamespace2read and write permissions

Question 9

YARNIn the service, if you want to give the queueQueueAset capacity to30%, which parameter should be configured?

Options:

A.

yarm.scheduler.capacity.rot.QueueA. user-limit-factor

B.

yam.scheduler.capacity.root.QueueA. minimum-userlimit-percent

C.

yarm.scheduler.capacity.root.QueueA. capacity yarm.scheduler.capacity.rot.QueueA

Question 10

FlinksupportLocalpattern andClusterpattern deployment(and cloud deployment), other deployment modes are not currently supported.

Options:

A.

True

B.

False

Question 11

HBaseFilters can set column names or column values as filter conditions, and support the use of multiple filters at the same time.

Options:

A.

True

B.

False

Question 12

Which of the following business applications is notHiveApplicable scenarios?

Options:

A.

Real-time real-time online data analysis

B.

Data mining (user behavior analysis, interest division, regional display)

C.

Data Aggregation (Daily/Weekly user clicks, click ranking)

D.

Non-real-time analysis (log analysis, statistical analysis)

Question 13

FusionInsight ManagerRegarding the management operations of services, which of the following statements is wrong?

Options:

A.

Can start, stop and restart the service

B.

Services can be added and uninstalled.

C.

Uncommon services can be set to hide or show

D.

Can view the current status of the service

Question 14

coordinator.xmlis the configuration file responsible for scheduling the workflow

Options:

A.

True

B.

False

Question 15

HDFSclient withNWhen a copy writes a file, which of the following is true about the writing process?

(multiple choice)

Options:

A.

eachDataNodemaximum storage1copy.

B.

Support multiple users to write to the same file at the same time.

C.

The first copy of the data block is placed preferentially on the node where the client writing the data block is located.

D.

Copied file blocks all exist on the same rack by default.

Question 16

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual. now

useRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

Options:

A.

usehashstructure, and a table uses ahashKEY, a row in the table records forhash keyone of indivualed.

B.

usehashTable structure, each row record of each table uses onehashKEY, hashKEYofebCorrespondence table

C.

record fields, and designKEYEach table adds a different prefix to distinguish it.

useringstructure, each field of each row record of each table uses oneKEY.

D.

usem3structure, each row record of each table uses oneKEY,,aueAll records for a row in the table. The value after the field is concatenated.

Question 17

due toSparkis a memory-based computing engine, therefore, aSparkThe amount of data the app can handle Can't give more than thisSparkThe total memory of the application.

Options:

A.

True

B.

False

Question 18

TowardsHBaseincrease in the clusterRegionServerhost, the original cluster must be stopped first, becauseHBase

Dynamic expansion is not supported.

Options:

A.

True

B.

False

Question 19

FusionInsight HDin, belonging toStreamingWhat are the methods of data distribution? (multiple choice)

Options:

A.

Shuffle Grouping

B.

Field Grouping

C.

Local Grouping

D.

Direct Grouping

Question 20

existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIright

CollectionDo some verification. Below aboutSolr Admin UIIs the statement correct? (multiple choice)

Options:

A.

clickClouddownTree,CheckSolr CloudmiddleCollections, a configuration set, andlive_nodesequivalence information

B.

clickCloud, see eachCollectionunder eachShardofReplicadistribution and status

enter aCollectionofShard ReplicaofCore Overviewinterface, you can view theRepicaSow

C.

Quoting the actual number of documents, storage size and location information

D.

Solruser group,Solr adminrole andSuper groupof users have access toSolr Admin UI

Question 21

FusionInsight HD in real-time processing scenarios, what computing frameworks are available? (multiple choice)

Options:

A.

Spark Streaming

B.

Streaming

C.

MapReduce

D.

HDFS

Question 22

Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS, can no longer be written backKafka.

Options:

A.

True

B.

False

Question 23

When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?

Options:

A.

1

B.

2

C.

3

D.

4

Question 24

Set the block storage size to128M,HDFSWhen the client is writing a file, when writing a100Msize file, real

How much storage space does it take up?

Options:

A.

128M

B.

100M

C.

64M

D.

50M

Question 25

Does a project require Internet access to a certain area? ? Save and search the full text of these Internet records? ? information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

Options:

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ? when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Question 26

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual.

Currently usingRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

Options:

A.

usehashstructure, and a table uses ahash key, a row in the table records forhash keyone offield.

B.

usehashstructure, each row record of each table uses onehash key,hash keyoffieldThe fields of the corresponding table records,

and designKEYEach table adds a different prefix to distinguish it.

C.

usestringstructure, each field of each row record of each table uses oneKEY.

D.

usestringstructure, each row record of each table uses oneKEY,valueAll fields recorded for a row in a table? ? back

value of .

Question 27

HDFSIt adopts a "write once, read many" file access model. So it is recommended that a file be created, written and

After closing, do not modify it again.

Options:

A.

True

B.

False

Question 28

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

Options:

A.

to emit

B.

execute

C.

open

D.

nextTuple

Question 29

FusionInsight HDin useStreamingofA, CKWhich of the following statements is true? (multiple choice)

Options:

A.

enabledAckerLater,StreamingIt will identify the failure to sendTupleAnd automatically resend without human intervention.

B.

AckerThe message that will identify the processing timeout or processing failure isfail.

C.

fromspoutstarted, formedTupleFailure of any link in the tree will mark the entire tree as failed.

D.

The application needs to be inspoutoffail() interface method to implement message resend logic.

Question 30

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

Options:

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Question 31

Which of the following methods can generateDStreamobject?

Options:

A.

KafkaUtils.createStream(…)

B.

KafkaUtils.createDirectStream(…)

C.

StreamingContext.socketStream

D.

StreamingContext.fileStream(…)

Question 32

FusionInsight HDofHiveIn the application, there are the following scenarios: ? ? ? Is there a higher storage file? ? efficiency, and most

Minute? ? Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.

Options:

A.

True

B.

False

Question 33

HDFSThere is a file in the cluster and directorytext.txt, which of the following commands can find theDatNodeFestival

point information?

Options:

A.

hdfs fsck /test.txt–files

B.

hdfs fsck /text.txt–locations

C.

hdfs fsck /test.txt–blocks

D.

hdfs fsck /test.txt–list–corruptfileblocks

Question 34

FusionInsight HDmiddle,StreamingWhich of the following scenarios is applicable? (multiple choice)

Options:

A.

Streaming data monitoring

B.

Real-time visit statistics of the website

C.

Offline log analysis

D.

Traffic flow analysis

Question 35

Options:

A.

JDB, Cinterface

B.

ODB, Cinterface

C.

Pythoninterface

D.

Rubyinterface

Question 36

FusionInsight HDmiddle,StreamingPackaging tools are used to package business codejarpackages and other dependenciesjarpackage, etc.

a completeStreamingAppliedjarBag

Options:

A.

True

B.

False

Question 37

existFlumeDuring the cascade transfer process, you can usefailovermode transmission, so that if startled

ofFlumeWhen the node fails or the data is received abnormally, it can automatically switch to another path to continue transmission

Options:

A.

True

B.

False

Question 38

SparkThe calculation logic of the application will be parsed intoDAG, this parsing operation consists of which of the following function modules Finish? ?

Options:

A.

Client

B.

ApplicationMaster

C.

Executor

D.

Driver

Question 39

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (multiple choice)

Options:

A.

Only the HMI account can be used.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 40

HDFSclient withNcopy toHDFSWhen writing a file, if one of the replicas fails to write, all replicas

will return write failure

Options:

A.

True

B.

False

Question 41

FusionInsight Managerinterface, when receivedKafkaInsufficient disk capacity alarm, and the alarm's

When the cause has been ruled out for the hard disk hardware failure, the system administrator needs to consider expanding the capacity to solve this problem.

Options:

A.

True

B.

False

Question 42

existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?

Options:

A.

Put

B.

HTable

C.

HBaseAdmin

D.

Puttist

Question 43

Options:

A.

The port preferentially downloads data from the nearest DataNode.

B.

The port file data is first returned from the DataNode to the NameNode, and then downloaded from the NameNode to the client.

C.

If the client and the connected DataNode fail while reading, the client will abandon the failure

node, to connect to the node where its replica is located.

D.

The port supports multiple clients reading the same file data from the DataNode at the same time.

Question 44

deployFusionInsight HD within the same clusterFlume ServerHow many nodes are recommended to deploy at least?

Options:

A.

1

B.

2

C.

3

D.

4

Question 45

Flumesupport real-time data collectionsourceWhich is the type?

Options:

A.

taildir

B.

Log

C.

JMS

D.

Thrift

Question 46

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

Options:

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Question 47

YarmWhich role is to manage individual node resources (CPU/Memory)of?

Options:

A.

NodeManager

B.

ResourceManager

C.

DataNode

D.

NameNode

Question 48

MapReduceIn which of the following is the task ultimately performed?

Options:

A.

NodeManager

B.

container

C.

ResourceManager

D.

AppMaster

Question 49

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result. close

AtSolrThe response format of the query result, which of the following statements is wrong?

Options:

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Question 50

FlinkThe two key elements of the program arestreamdata andtransformationoperator.

Options:

A.

True

B.

False

Question 51

aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, the following description does not the correct one is?

Options:

A.

Support based onRESTquery request

B.

WebHCatThe return data format isXML

C.

WebHCatbased onHTTPandHTTPSAgreement to provide services to the outside world

D.

able to passWebHCatCreate tables, query, etc.

Question 52

existSparkmiddle,SparksQLis an independent module that does not depend onSparkCorefinish independentlySQL Actions such as the corner line of the statement.

Options:

A.

True

B.

False

Question 53

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (many select)

Options:

A.

Only use HMI account.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Question 54

Below aboutZooKeeperThe statement is wrong ().

Options:

A.

If ZooKeeperIf there is an interruption in the process of synchronizing the message, after the failure is recovered, the transmission status before the failure can be adjusted according to the the state continues to synchronize, that is, support for resuming transmission from a breakpoint.

B.

ZooKeeperUse a custom atomic message protocol to ensure the consistency of node data in the entire system.

C.

ZooKeeperThe cluster is elected at startupLeaderRole.

D.

LeaderAfter a node receives a data change request, it first writes to disk and then writes to memory.

Question 55

RedisofLISTData structure (multiple choice) Which scenarios?

Options:

A.

Build queuing systems, such as message queues

B.

uniq operations, such as getting the ranking value of all data in a certain period of time

C.

Get the latest N data operations: For example, for a Weibo, get the latest 10 comments,

D.

Simulate stack operations

Question 56

pass throughHBasefofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families(family)

D.

can be empty

Question 57

existBaseIn application development, when a table'sRowkeyRange and distribution are known, pre-score is recommendedregion,

Please call the following code (fragment) to pre-sort a tableregionAfter this table will create severalregion?

splits【0】=Bytes.toBytes("A");splits【1】=Bytes.toBytes("H");splits【2】

=Bytes.toBytes("O");splits【3】=Bytes.toBytes("U");admin.createTable(htd,splits);

Options:

A.

3

B.

4

C.

5

D.

6

Question 58

FusionInsiat HD the user wants to passHBase shelloperation to query aHBasein the table. In this scenario, it is recommended that the administrator assign a machine account to this user.

Options:

A.

True

B.

False

Question 59

aboutStreamingthe topology (Topology), which of the following descriptions is wrong?

Options:

A.

OneTopologyis made up of a group ofSpoutcomponents andBoltcomponents passStream Groupingsconnected

B.

Directed acyclic graph (DAG)

C.

Topologywill run until it is explicitlykill

D.

Business logic is encapsulated inTopologymiddle OneTopologyOnly one can be specified to startWorkerprocess

Question 60

existflumemiddle,sourceWhat is the main function of the function module?

Options:

A.

Get data and convert raw data into data objects that you process yourself

B.

Cache data and save data in memory or files according to different reliability policies

C.

Output data to the destination, support multiple output protocols

D.

Split the data and send the data to different destinations according to the characteristics of the data

Question 61

writingMapReduceWhich two interfaces are usually required to be implemented by developers?

Options:

A.

mapandcombine

B.

reduceandcombine

C.

combineandsort

D.

mapandreduce

Question 62

forFusionInsight HDplatformHBaseComponent, which properties of the secondary index need to be defined to add a secondary index? (multiple choice)

Options:

A.

index name

B.

index column

C.

index column type

D.

The name of the column family to which the indexed column belongs

Question 63

aboutRedisCluster topology information, is the following description correct?

Options:

A.

The client caches the topology information of the cluster

B.

The server caches the topology information of the cluster

C.

both are

D.

more than two

Question 64

existFlumeDuring cascaded transfers, you can usefail overmode transfer, so that if the next hop isFlumenode failure or

When the data is received abnormally, it can automatically switch to another way to continue transmission.

Options:

A.

True

B.

False

Question 65

existFusionInsight HDWhen developing applications with a secure version, you can usekeytabDocuments are authenticated securely.

Options:

A.

True

B.

False

Question 66

FusionInsight HDsystem, aboutHiveWhich of the provided file formats is not a columnar file?

Options:

A.

CRC

B.

Parquet

C.

RCFile

D.

TextFile

Question 67

Which of the following is notMapReducespecialty?

Options:

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Question 68

existFlumemiddle,sourceWhat is the main function of the function module?

Options:

A.

Get data and convert raw data into data objects that you process yourself

B.

Cache data and save data in memory or files according to different reliability policies

C.

Output data to the destination, support multiple output protocols

D.

Split the data and send the data to different destinations according to the characteristics of the data

Question 69

HDFSused is"Write once, read many"file access model. So it is recommended that a file be created, written and closed After closing, do not modify it again.

Options:

A.

True

B.

False

Question 70

A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

Options:

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ?when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Question 71

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result.

aboutSolrThe response format of the query result, which of the following statements is wrong?

Options:

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Question 72

FusionInsight HDofHive, user-definedUDFcan andHiveBuilt-inUDFduplicate name, in this case,

will use user-definedUDF.

Options:

A.

True

B.

False

Question 73

in aMapReduceapplication,mapThe output of the function is viaMapReduce? ?After processing, send toreduceletter

number. This process belongs to? ?Sort and group pairs.

Options:

A.

True

B.

False

Question 74

existSparkmiddle,SparkSQLis an independent module that does not depend onSparkCorefinish independentlySQLSentence parsing, optimization

operations such as transformation and execution.

Options:

A.

True

B.

False

Question 75

FusionInsight HD in which ways can you viewOozieDebug results of the job? (multiple choice)

Options:

A.

CheckOozieofJava APIWhether the returned result is as expected

B.

pass throughHueofworkflowDashboard to view program running results

C.

pass throughHue's file browser to see ifHDFSThe specified directory produces the expected file

D.

Oozieself-containedWebThe interface can also view the job execution results

Question 76

TowardsHBaseincrease in the clusterRegionServerhost, the original cluster must be stopped first, becauseHBaseDynamic expansion is not supported.

Options:

A.

True

B.

False

Question 77

FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most

Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.

Options:

A.

True

B.

False

Question 78

HDFSIn application development of , which of the following areHDFSInterfaces supported by the service? (multiple choice)

Options:

A.

BufferedOutputStream.write

B.

BufferedOutputStream.flush

C.

FileSystem.create

D.

FileSystem.append

Page: 1 / 20
Total 526 questions