Site Reliability Engineering

Posted by Kit Merker |
Measuring and Optimizing CPU Performance

Software engineers are under constant pressure to make their desktop, laptop, and client applications run as efficiently as possible.  That’s easier said than done. Just ask Aaron Tyler, Principal Software...

Continue Reading
Posted by Marcin Kurc |
Monitoring Tells Me Nothing, Says Your CEO

Do you need to justify your infrastructure spend to your CEO? In this article, we’ll equip you to communicate clearly and in CEO language about how the infrastructure investment you...

Continue Reading
Posted by Marcin Kurc |
Nobl9 Has Joined The Cloud Native Computing Foundation

Today we are proud to announce that Nobl9 has become a Silver member of the Cloud Native Computing Foundation (CNCF) and the Linux Foundation. We know that members of this...

Continue Reading
Posted by Alex Nauda |
Optimizing Cloud Costs through Service Level Objectives

Are you spending too much on cloud services? Is there a way to optimize your cloud spend and dramatically lower it? Here’s a hint: it has nothing to do with...

Continue Reading
Posted by Kit Merker |
Do You Really Need Five Nines?

Call it what you will: Always On, Six Sigma, High Availability, or Five Nines. Management wants you to deliver a service as close to perfection as possible. Little do they...

Continue Reading
Posted by Kit Merker |
Creating Your First SLO: A Discussion Guide

Congratulations! You are ready to sit down with your team and establish your first Service Level Objective (SLO). You might be wondering where to start. Here’s an outline of how...

Continue Reading
Posted by Brian Singer |
Intro to Error Budget Policies

This article will get a bit deeper into how to turn service level objectives (SLOs) into a tool for balancing investments in new features and in improving the reliability of...

Continue Reading
Posted by Marcin Kurc |
How to Explain SRE to Your CEO

As we discussed in the previous post, Site Reliability Engineering (SRE) is an operating model that helps your organization grow and innovate with velocity while maintaining infrastructure reliability for service...

Continue Reading
Posted by Kit Merker |
How do we measure the customer experience?

So many interactions with customers are going through digital channels that you’d think it would be easy to understand what our users are going through! It seems archaic to force...

Continue Reading
Posted by Alex Nauda |
You’re Not Google. And, Yes, You Still Need SLOs

Much of the available literature about Site Reliability Engineering (SRE) and Service Level Objectives (SLOs) refers to pure, homogeneous infrastructure environments (think hyperscalers, including Google, where the concept of SRE...

Continue Reading