Big Black Friday Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

Free and Premium PeopleCert DevOps-SRE Dumps Questions Answers

Page: 1 / 6
Total 80 questions

Site Reliability Engineering (SRE) Foundation v1.2 Questions and Answers

Question 1

What does the term "wisdom of production" mean?

Options:

A.

Taking an engineering-based approach to problems rather than just toiling at them repeatedly

B.

The wisdom gained from something running in production

C.

Monitoring and alert notifications from staging environments

D.

If a task can be automated then it should be automated

Buy Now
Question 2

Which of the following is the LEAST useful metric when working to improve antifragility?

Options:

A.

Mean Time To Detect

B.

Service Level Objective

C.

Deployment frequency

D.

Recovery Point Objective

Question 3

Which of the following BEST defines the golden signal for errors?

Options:

A.

The time it takes to service successful as well as failed requests

B.

The rate of failed requests—either explicitly, implicitly, or by policy

C.

The demand placed on your system by the volume of requests

D.

The percent of capacity used by your system for current requests

Question 4

Which of the following BEST describes how to contribute to achieving higher levels of availability?

    Measuring the critical aspects

    Maintaining a close relationship with development teams

    Measuring staff performance

    Maintaining a close interval between detection and correction

Options:

A.

1 and 2

B.

2 and 3

C.

3 and 4

D.

1 and 4

Question 5

Which of the following BEST describes the most important rationale for NOT seeking an SLO of 100% availability?

Options:

A.

It is not realistic for the complexity and scale of services.

B.

The likely result is failure where such targets are defined.

C.

There is no room for improvements if targets are so high.

D.

The user satisfaction score is affected by a low percent.

Question 6

The new SRE team is advocating against a fixed Error Budget.

Why are fixed Error Budgets better?

Options:

A.

They create more toil

B.

They encourage working in smaller batches that reduces risk

C.

Fixed Error Budgets are never exceeded

D.

They help predict outages

Question 7

Which scenario BEST illustrates how stability and agility can be achieved with simplicity?

Options:

A.

An SRE team is adopting easy-to-understand change procedures to streamline the process

B.

An SRE team is releasing a major update by automating continuous and small deployments

C.

An SRE team is creating procedures, practices, and tools that render software more reliable

D.

An SRE team is protecting reliability by using processes and procedures to control updates

Question 8

The value of data-driven measurements can be MOST accurately explained by which of the following?

Options:

A.

An analysis and understanding of data helps to ensure fact-based decision-making

B.

The gathering of data will provide all the necessary facts to enable better decisions

C.

Data mining enables an organization to determine the legitimacy of all metrics

D.

Objectives can only be appropriately designed when based upon actual data

Question 9

Why is it important to have the future growth envelope outlined?

Options:

A.

To ensure only signed artifacts are deployed

B.

To ensure that the service can meet current and future scale estimates

C.

To review Service Level Objectives and Service Level Indicators

D.

To review or revise Error Budgets

Question 10

Which of the following BEST explains how an error budget allows for a maximum change-velocity?

Options:

A.

Developers can focus on pushing out feature changes while the error budget remains high.

B.

Developers must slow down feature changes in line with the percentage the budget is used.

C.

Developers focus only on new feature work versus operational work if the budget is empty.

D.

Developers rush to do development work if the budget is high and slow down when it is low.

Question 11

What metrics will embracing failure help to improve?

Options:

A.

Mean time to detect and mean time between system incidents

B.

Change lead time and change failure rate

C.

Empirical test data and mean time to recover service

D.

Mean time to detect and mean time to recover

Question 12

Reliability is a key pillar of digital experience monitoring and incident management.

Which of the following describes the BEST type of reliability monitoring strategy in SRE?

Options:

A.

A strategy that uses traditional and familiar monitoring tools rather than advanced artificial intelligence

B.

A strategy that instruments observability and provides monitoring insights across all components and layers

C.

A strategy that focuses on monitoring and discovering useful patterns in the performance of all active networks

D.

A strategy that harnesses advanced technologies to measure, analyze, and maintain the fitness of applications

Question 13

Which of the following BEST illustrates the engineering approach for work done within SRE?

Options:

A.

An SRE is rapidly coding a solution to automate a daily tuning activity by following a set of best practices and principles.

B.

An SRE is designing a solution to eliminate toil and scale up service delivery by learning from other successful solutions.

C.

An SRE is deploying a solution using an end-to-end pipeline that has been carefully analyzed from the start.

D.

An SRE is resolving an incident as quickly as possible using a well-designed implemented process and knowledge base.

Question 14

Which TWO of the following are BEST described as traditional escalation paths?

    Functional

    Hierarchical

    Cyclical

    Logical

Options:

A.

1 and 2

B.

2 and 3

C.

3 and 4

D.

1 and 4

Question 15

Which of the following is the BEST description of a Customer Reliability Engineer (CRE)?

Options:

A.

They take a software engineering approach to redesign all cloud services

B.

They use deep engineering expertise to improve the cloud provider’s services

C.

They work with the cloud provider's SRE team to ship and build new features

D.

They integrate with the customer’s operations team to share responsibilities

Question 16

What is one of the key characteristics of a Service Level Indicator (SLI)?

Options:

A.

It must be captured in a Service Level Agreement (SLA)

B.

It should focus on server-side metrics

C.

It must have a time horizon

D.

It must be agreed to by the SRE team and the Agile Team

Question 17

What is the primary difference between SRE and DevOps?

Options:

A.

SRE is an implementation of DevOps but focuses mostly on post-production responsibilities

B.

DevOps is mostly for software engineers and SRE is mostly for infrastructure engineers

C.

DevOps encourages closer collaboration between development and operations whereas SRE is about building a silo around production operations

D.

DevOps and SRE are the same thing

Question 18

In a safety culture, engineers are allowed to do more with the production environment without fear of repercussions.

What else do engineers need to do?

Options:

A.

Share production incidents on social media

B.

Be accountable for their actions

C.

Skip all blameless post-mortems

D.

Avoid being on-call

Question 19

Why would some Service Level Indicators require client-side data?

Options:

A.

There may be metrics affecting users that are not reflected on the server side

B.

It would be difficult to negotiate service level agreements with customers without client data

C.

It would be difficult to engineer external automation without client side data

D.

Service Level Objectives may not be achievable without client side data

Question 20

What is the benefit of strategically burning the Error Budget to zero every month?

Options:

A.

It allows a balance between velocity and stability

B.

It allows for the measurement of capacity and reliability

C.

It can be revised every month as necessary

D.

It creates a dialog between strategic partners

Question 21

An organization is experiencing significant turnover of IT operational staff with most not staying more than one year. The HR Director and IT Director are trying to determine why they are having difficulty retaining IT operations professionals.

What could be one of the reasons?

Options:

A.

Overload and disruptive work patterns

B.

Lack of time for skills development

C.

More time spent managing the backlog than fixing problems

D.

All of the above

Question 22

When outages are repetitive and similar, they become a form of toil.

Which of the following describes the MOST compelling reason to adopt advanced technologies and artificial intelligence (AI)?

Options:

A.

To increase reliability by reducing MTTR and MTRS

B.

To increase the mean time to repair services (MTTR)

C.

To increase the mean time to restore services (MTRS)

D.

To increase reliability and achieve perfect MTRS

Question 23

Which of the following is the definition for Application Performance Management (APM)?

Options:

A.

The highly automated communications process by which measurements are made and other data collected at remote or inaccessible points and transmitted to receiving equipment for monitoring

B.

The monitoring and management of performance and availability of software applications

C.

The use of a hardware or software component to monitor system resources and performance of a computer system

D.

Ways for engineers to communicate quantitative data about systems

Question 24

A team has exceeded their error budget by 10% in a particular month.

Give an example of what should happen next as a consequence.

Options:

A.

Sprint planning may only pull post-mortem action items from the backlog

B.

The Error Budget is reviewed to determine if it was realistic for the product or timeline

C.

The Error Budget is extended for another month to determine if this breach was an anomaly

D.

The error budget is ignored in subsequent months as it is creating the wrong kind of behavior

Page: 1 / 6
Total 80 questions