Events
Explore Our Events: From Upcoming Sessions to a Library of Past Highlights, Dive into the Latest Insights and Trends in SRE and Reliability Engineering.
Upcoming Events
Gartner IOCS Conference
Event
10 Dec 2024 · 7 AM PST
Las Vegas, NV
Past Events
Reliability Management With SLOs: Build Vs. Buy
Service Level Objectives (SLOs) have become essential for effective reliability management, but choosing the right path for implementation is crucial. Teams must decide whether to build their own SLO infrastructure, leverage SLO features in existing monitoring tools, or invest in a dedicated SLO platform. Each option has its own set of challenges, and making the wrong choice can lead to wasted resources, delayed results, and even a failed SLO program.
Is MTTR Dead?
Join us for a critical discussion on why Mean Time to Repair (MTTR) may no longer be the best metric for measuring system reliability. This webinar will explore how Service Level Objectives (SLOs) are transforming reliability management by shifting the focus from reactive to proactive strategies. Expert speakers will explain the limitations of MTTR and how SLOs provide a more comprehensive view of system health. Ideal for IT professionals seeking to enhance their approach to reliability, this event offers actionable insights for modern incident management.
Building Reliable E-Commerce Experiences
Join us for an in-depth discussion on how to develop further reliability strategies for e-commerce. Our experts from Google and Nobl9 will cover the critical role of reliability in exceptional user experiences. You'll gain insights into common challenges and practical strategies for maintaining peak performance, including effectively using Service Level Objectives (SLOs). Additionally, learn about the latest industry trends, real-world case studies, and proactive measures to prevent downtime and improve customer satisfaction.
Navigating DORA Compliance - Insights From Industry Experts
The Digital Operational Resilience Act (DORA) is reshaping the landscape for critical service providers worldwide. As the January 2025 implementation date approaches, organizations must adapt not only their policies and procedures but also their technology solutions. Learn how industry-leading technologies are working together to address risk management and resilience, DevOps practices and AI-driven resilience strategies, third-party oversight, test/validation methods, transparency, and incident response.
Workshop: Enhancing Reliability on AWS with Service Level Objectives
Reliability of your offerings is non-negotiable to users, and too often is siloed to the various elements of your app and infrastructure. Achieving a customer-centric view of reliability - going beyond incident management and identifying day-to-day issues affecting the user experience - requires a holistic approach. Implementing Service Level Objectives (SLOs) and error budgeting allows you to monitor your application as a singular entity, prioritizing issues that affect the user and identifying the invisible mini-incidents that cause product abandonment and switching.
Company-wide SLOs: Building Composite SLIs Across Many Teams
In this session, we will discuss creating and unifying SLIs that software teams can use to increase overall performance and understanding of how each component works together. Setting company-wide SLOs will give teams something to target and increase system reliability.
Graceful Degradation and SLOs
Learn how SLOs empower organizations to maintain service reliability and availability, even amidst the challenges related to managing complex software. Discover how graceful degradation strategies can improve user satisfaction in the face of disruptions or resource limitations.
Our panelists: Alex Hidalgo (Nobl9), Jennifer Mace (Google), Mandi Walls (PagerDuty) and Niall Murphy (Stanza).
5 Strategies and Business Benefits of SLOs
Join us as we delve into the art of mastering success with Service Level Objectives (SLOs). We are thrilled to have SLO Expert and Author, Alex Hidalgo, along with Srikar Achanta, Senior Sales Engineer at Nobl9, share their insights. Together, we will explore effective strategies and the numerous business advantages of implementing SLOs.
Gartner IT Infrastructure, Operations & Cloud Strategies Conference
Gartner IT Infrastructure, Operations & Cloud Strategies Conference 2023 brings the world’s technology leaders together to hear top trends, find objective answers and explore topic coverage in addition to best practices. Gain the insights and guidance to create an effective pathway to the future and network with your peers.
AWS re:Invent
AWS re:Invent offers cloud enthusiasts, experts, and practitioners an experience like no other. You get five days of opportunities to find productive solutions that will transform your company and your pick of 2,000+ learning sessions to quickly master new skills. And, the event gives you hundreds of ways to build connections that can mean the next big step for your career—or your business.
Drive SLO Adoption through SDLC Templates
This upcoming webinar with our SRE Keri Melich, "Drive SLO Adoption through SDLC Templates," aims to provide a comprehensive exploration of the strategies and best practices for seamlessly incorporating SLOs into the various phases of the SDLC. By leveraging well-designed templates, development teams can effectively align their goals with business objectives and customer expectations, improving user experiences and increasing overall system reliability.
DevOpsDays Chicago
DevOpsDays Chicago brings development, operations, QA, InfoSec, management, and leadership together to discuss the culture and tools to make better organizations and products. The 2023 event will be the ninth time we have held DevOpsDays Chicago, and it should be bigger and better than ever before!
The format of DevOpsDays Chicago includes a single track of 30 minute talks in the morning of each day, followed by Ignite talks (5 minute lightning talks). We spend the rest of the afternoon in Open Spaces, which are considered a key portion of the event.
Using SLOs to Improve Software Delivery
Kit Merker is the one of the original product managers for Kubernetes and now Chief Growth Officer at Nobl9 where they're delivering a new open standard called #OpenSlo. SLOs, or service-level-objectives, provide a framework for understanding performance targets and making judgements about software changes and how they impact uptime.
But it's not just a standard, it's also code.
AWS Summit New York
Together AWS and Nobl9 make it easy to implement Service Level Objectives (SLOs) from observability tools like Amazon CloudWatch, Amazon Managed Service for Prometheus, and Amazon Managed Grafana.
SLOs built with Nobl9 tie customer-defined objectives to the health and performance of services, enabling:
- More actionable observability
- A holistic view of services defined by the business users
- Optimize IT resource spending through high-confidence, data-driven insights
Nobl9 allows users to quickly and easily implement, maintain, monitor, and automate actions for SLOs without requiring a systems rewrite or expensive change management.
SLOconf Monthly - with Adriana Villela of Lightstep
SLOconf is the only event dedicated to the practice and application of SLOs. The event features industry leaders in SRE, DevOps, and software engineering and is open to all. Join us this month to dive into the Observability Anti-Patterns.
Cisco Live
Together Cisco and Nobl9 deliver reliability solutions using Service Level Objectives to enterprises. With native support for AppDynamics and Thousandeyes, in addition to 23+ other Observability platforms, Nobl9 makes it easy to implement Service Level Objectives (SLOs).
SLOs built with Nobl9 tie customer-defined objectives to the health and performance of services, enabling:
- More actionable observability
- A holistic view of services defined by the business users
- Optimize IT resource spending through high-confidence, data-driven insights
Nobl9 allows users to quickly and easily implement, maintain, monitor, and automate actions for SLOs without requiring a systems rewrite or expensive change management.
Fast Track to SLOs with Nobl9
Join us for a webinar with Jeremy Cooper, where we will delve into the world of SLOs and how they can help you in your business. SLOs can give you valuable visibility and insight into your user's experiences with your applications. SLOs can help you reduce customer frustration and protect your reputation by helping you focus on reliability. This webinar will show how to create and manage SLOs using Nobl9.
SLOconf 2023
SLOconf is the only event dedicated to the practice and application of SLOs. The event features industry leaders in SRE, DevOps, and software engineering and is open to all.
Building on the success of SLOconf 2022, where we featured 63 companies and more than 1,000 leading brands from 83 countries in attendance over four days, our third trip around the world in 2023 will even be better!
Kubernetes In The Enterprise
Organizations of all sizes are now routinely deploying cloud-native applications based on microservices and constructed using containers. The primary platform used to deploy those applications are Kubernetes clusters that make it possible to dynamically scale consumption of infrastructure resources.
The issue now is determining how best to manage fleets of Kubernetes clusters that are widely distributed across the enterprise.
Introducing EKG - Essential Kubernetes Gauges
If you’re running Amazon EKS, you might ask "is my Kubernetes environment reliable? Is my cluster healthy?" To easily answer this question, you can use Essential Kubernetes Gauges (EKG), an open-source, prefabricated approach to monitoring Kubernetes and setting default service level objectives (SLOs).
SLOs allow you to set goals for reliability. EKG includes SLOs that measure several aspects of a cluster:
- Control Plane Health: Is the Kubernetes API responding normally? Is it performant?
- Cluster Health: Are the nodes healthy? Is there some minimum resource headroom? Can we start new workloads?
- Workload Health: Is anything in a bad state? Is there at least some kind of workload running?
- Resource Efficiency: Are resources underutilized in this cluster? Is the cluster scaling so that it makes good use of resources without endangering workloads?
- FUTURE: Cost Efficiency: measurements have been proposed as future enhancement and are under consideration.
SREcon: Alex Hidalgo
Hell Is Other Platforms. Deliberately or inadvertently, everyone builds a platform to support their business mission. And in doing so we often look to what others have done to get us there. But: what if other's principles, practices and platforms aren't applicable to our context?
In Sartre's 'No Exit' people are brought to a mysterious room in hell where they had all expected medieval torture devices to punish them for eternity, but eventually realize they had been put together to torture each other. Join Andrew Clay Shafer as DevOps and Alex Hidalgo as SRE as they both wait for Platform Engineering to join them in hell. During their suffering they will explain how you can best retain your agency, identify your needs, and understand how you can avoid the afterlife; while also outlining how we all got here in the first place.
Webinar: How Should I Set My SLOs?
Understanding what reliability target to choose takes time and effort. It requires understanding how your system performed over the past month or year. Customers normally will create an SLO and adjust the targets every few weeks as they better understand. With SLI Analyzer, you can retrieve your historical data, set reliability targets, and see how your system performed. You can easily tweak the targets and view updated error budgets and error budget burn down. Once you’ve determined the right target using your existing data, you can easily create a new SLO.
DevOps Roundtable
Observability has always been a core DevOps tenet, but achieving and maintaining it is challenging. Most DevOps teams today have been able to achieve some level of continuous monitoring using a set of pre-configured metrics that are relatively easy to track. Observability, however, takes monitoring to the next level by making it simpler to discover the root cause of IT issues before services are disrupted. There is no shortage of observability platforms today; the challenge is determining the best practices that should be put in place to employ them most effectively.
Webinar: SLO 101 With Nobl9
Join this Service Level Objective (SLOs) training, where you will learn how to: Collect website uptime data and turn it into SLOs, set up error budget alerting workflows, and analyze service risks to tune your SLOs.
Learning from Incidents, 15-16th February
FI is a community challenging conventional views and reshaping how the software industry thinks about incidents, software reliability, and the critical role people play in keeping their systems running.
In today’s economy, software organizations can’t afford to not learn from incidents.
Time Traveling SLO
Tune in to hear Keri Melich explain how you can create Service Level Objectives (SLOs) with historical data in minutes instead of days or weeks.
Time is the key that unlocks the answers to many of our questions about how our services perform. Join us as we travel through the SRE spacetime continuum with Nobl9’s new feature: Replay.
AWS re:Invent
Together AWS and Nobl9 make it easy to implement Service Level Objectives (SLOs) from observability tools like Amazon CloudWatch, Amazon Managed Service for Prometheus, and Amazon Managed Grafana.
How Nobl9 Uses SLOs Internally
Many people ask us, “if you’re so into SLOs, how do you use them yourself? How do you use SLOs if you are on Kubernetes?” Here is a chance to hear firsthand how Ian Bartholomew, SRE Manager at Nobl9, uses SLOs to build and run the Nobl9 Kubernetes platform. You’ll see real data, some example situations, and how SLOs help us run reliably and efficiently and keep everyone on the same page (and the pager off).
AKAI Code
The first edition of the AKAI Code hackathon organized by AKAI - Akademickie Koło Aplikacji Internetowych - takes place at the Poznań University of Technology.
Nobl9 is a sponsor of this event.
SREcon EMEA
The technology industry moves at an incredible pace. Innovation and change are always at the forefront of everyone's mind. Especially in the Site Reliability Engineering space, people feel pressured more than ever to keep up with all of the newest tools, processes, and philosophies. For many organizations, however, chasing all of the shiny things can end up being a detriment as opposed to a benefit. Let's examine these pressures, what the realities of most SRE organizations are, and how we can all best move into the future - together, thoughtfully, and meaningfully.
DevOps Days Tampa Bay
DevOpsDays is a worldwide series of community run technical conferences covering topics of software development, IT infrastructure operations, and the intersection between them. It is run by volunteers from community, for the benefit of the community.
SLOconf Monthly
How do you plan for unplanned incidents? You practice with Chaos Engineering. Strong incident response doesn’t just happen, you have to build the skills and train your team. Practicing for major incidents gives your team insight into how your applications will behave when something goes wrong and how it might impact your SLOs, as well as how the team will interact to solve problems. Combining your Incident Response practices with Chaos Engineering roots your response practice in real-world scenarios, helping your team build confidence.
P99 Conference
People want their computer services to operate well, and consequently they often set targets for how reliable they hope to be. In addition to target percentages, percentiles are often used to make it easier to examine the data for most user experiences, while ignoring the outliers. But problems often arise because of a tendency to only think about "nines". All services and user experiences are unique, and it is all too common to ignore important data or stress your systems and humans by trying to be more performant than you need to be. Throw away your nines, and let's find some better numbers to use instead.
Nobl9 migration to AWS Managed Prometheus
Nobl9 uses SLOs to monitor reliability and their impact on user experience. As a rapidly growing business, Nobl9 employs AWS observability solutions that are cost-effective, rich in functionality, and that scales.
This webinar will look at the Nobl9 observability tooling migration to AWS Managed Prometheus, covering:
- Cost saving strategies
- Open source options
- The migration process
- Implementation tips
DevOps Days Houston
DevOps Days event returns to Houston. Come learn more about DevOps technologies and best practices, and meet others in the community.
360 View Webcast
In the September episode of 360View, we'll cover these 3 disruptive and compelling technologies in 60 minutes where you'll learn:
- How do you find MDR providers who can deliver effective threat response? Today, organizations are seeing more sophisticated malware threats and targeted attacks, but traditional SOCs can't keep up. (w. Open Systems)
- How does true continuous data protection (CDP) work? Learn how Zerto's replication engine that doesn't sleep—it continuously replicates your production data to other locations in real time.
- How to define, measure and monitor your service level objectives (SLO)? Learn how Nobl9 can centrally manage SLOs across the enterprise.
AWS Build On Observability Day
Conversations between AWS experts and guests happen in real time, giving the audience a chance to influence the discussion and receive immediate answers.
One of the best ways to measure the availability of services is having SLOs (Service Level Objective) set to them. They help in giving you a more pragmatic way to measure things and calculate how much does it costs having a service unavailable. What you may not know is that you can define them using an agnostic standard called OpenSLO using YAML. In this session, Ian Bartholomew from Nobl9 get into the weeds of how to do that, showing step by step the definition, implementation, and migration of defined OpenSLO's from on-premise to the cloud. And he does all of this with a hands-on approach using only the terminal. How cool is that?
Sumologic Illuminate
Organizations of all sizes share the data challenges and opportunities presented by digital transformation, modern applications and cloud computing. Listen to workshops and technical certifications with product experts and technical gurus. Sal Furino from Nobl9 is presenting.
DevOps Days Charlotte
Devopsdays is a worldwide community conference series for anyone interested in IT improvement.
Alex Hidalgo of Nobl9 is speaking at this event.
DevOpsDays Charlotte
You need to measure your computer services to know how they're operating. Logs, metrics, traces, SLOs, statistical analysis, probabilistic normalization - within tech our approaches to understanding how things are going are more complex than ever. But it's not just software and hardware that needs to be monitored. Just about every discipline on the planet has some concept of how to monitor, measure, and account for detected failures.
In this talk we'll discuss some ways other disciplines such as the transit industry measure their own output, what the difference is between measurement and monitoring, and what we can learn from that when measuring our own.
Deutsche Bank's 2022 Technology Conference
Senior executives from leading companies in the technology industry will provide updates to you on their businesses. The format will be focused on more one-on-one / small group meetings in order to connect corporate management teams and investors. Presentations and panel discussions will also be an agenda highlight.
Marcin Kurc from Nobl9 is participating in a panel.
AWS Summit Chicago
Nobl9 is a global leader for implementing and managing service-level objectives (SLOs). This session shares how Nobl9 uses Amazon Managed Service for Prometheus for observability in their SaaS platform. Dive into the Nobl9 journey, from their existing observability tooling to AWS Managed Service for Prometheus, covering cost savings, benefits, the migration process and best practices for observability migration. Also learn how Nobl9 uses SLOs to monitor the reliability of external dependencies and their impact on their end user experience during migration.
DevOpDays Dallas
Come learn about the latest in the DevOps community and check out a demo of Nobl9.
The same group of seasoned organizers that brought you DevOpsDays DFW several years in a row are back again another year in 2022 to celebrate our 5th Anniversary. We were encouraged and motivated by the feedback received from previous year's events, which is why they are determined to bring an even better experience this year and improve the DevOps conversation throughout the DFW Metroplex.
Why SLOs are a Must-Have for SREs and Cloud-Native Services
The reliance on IT and the SREs to keep the business services deployed and running has never been higher, nor have the stakes. To survive and thrive in this environment, meeting reliability goals across your critical business services is essential. The only way to do that is to have effective Service Level Objectives.
Building Service Level Objectives for Scale: Case Studies and Best Practices
This webinar will share how organizations are creating and deploying Service Level Objectives (SLOs) at scale. You will learn about:
- Frameworks and best practices for implementing SLOs
- Innovative approaches organizations are using to ensure that SLOs grow with their needs
- Four real-world cases of successful deployments
Speaker: Sal Furino, Customer Success, Nobl9
SLOConf Monthly - June 2022
We are beyond excited to invite you to SLOconf Monthly Meetup on June the 15th at 12 pm PT 2022! Wayne Major - AutoSLO - How to create a large number of SLOs automatically for a large number of assets in your organization.
Building a Platform and SRE Team
With the rapid adoption of DevOps and cloud, many organizations establish platform or cloud engineering and site reliability engineering (SRE) as part of their operations and infrastructure teams. SRE and platform engineering can be complementary functions or combined as one team, as both benefit from applying engineering discipline, automation and infrastructure as code. Some organizations take a self-service or platform-as-a-product approach to better match the cadence and needs of product and development teams across the company.
How should you build your platform engineering and SRE teams, and what learnings can you gain from others? Join our cross-functional team of platform engineering and SRE experts as we explore these topics and insights.
Panel: The Business Impact of Digital Customer Experience
When people interact with your digital service, it leaves them with impressions about your brand and the value of your offering. The user’s journey across multiple systems is called the digital customer experience (DCX). Why does this matter? And how does system performance impact that experience, and the business?
Join us as we discuss:
- Why digital experiences matter, and to whom?
- Balancing and aligning business, security, and technology
- Challenges, obstacles, and opportunities
Nine common Service Level Objective mistakes (and how to avoid them)
Many organizations today are looking at Service Level Objectives (SLOs) as a way to create a deeper understanding of reliability. This webinar will explore:
- Strategies for optimizing SLOs for the enterprise
- Common mistakes people make when creating SLOs
- Resources you can leverage to reduce implementation risks
Webinar: Tracking SLOs and SLIs with Sumo Logic and Nobl9
Join Natalia Sikora-Zimna, Product Owner at Nobl9, to learn how Nobl9’s integration with Sumo Logic’s full-stack observability platform allows teams to set and understand clear reliability goals for their mission-critical applications and services. In this demo, we will cover how SLIs computed from logs and metrics using Sumo Logic’s powerful query language are automatically ingested into Nobl9 and expressed as SLOs designed to drive automated actions when error budgets are at risk to improve reliability over time.
Live Demo webinar with Nobl9
Highlights include:
- Configuring Pingdom to collect website uptime data
- Setting up error budget alerting workflows
- Strategies for tuning your SLOs
SLOconf 2022
Building on the success of SLOconf 2021, we’re bringing back the virtual conference to our community! This is an attend-while-you-work event, scheduled for May 9-12th, 2022, aimed for reliability engineers, enthusiasts, and anyone else who gets excited about reliability to gather and talk about SLOs in depth.
SLO 101 Hands on Training
Join this Service Level Objectives (SLOs) training where you will learn how to: Collect website uptime data and turn it into SLOs, set up error budget alerting workflows, and analyze service risks to tune your SLOs.
SLOconf Monthly
Julie Gunderson - "The Road to Reliability" and Vidya Subramanian - "SRE at large Enterprises".
Full Stack Observability Webinar
Learn about Cisco and Nobl9's perspective on full-stack observability for the enterprise IT stack.
Product Update with Customer Panel
We announce Hydrogen: A new way to prevent burnout, optimize velocity and rebalance technical debt is coming.
SLOconf Monthly
How is running your software infrastructure like racing a Formula 1 Car?
Game over? Reliability in Gaming
Steve Yegge, Developer of Wyvern and Nobl9 COO Kit Merker chat about reliability in gaming, war stories, and what it means to have reliable games.
SRE Meetup
29 Nov 2021 · 12 AM
Webinar
01 Jul 2021 · 9 AM
Already Scheduled? Our Pros Are Looking Forward to Your Request
Nobl9 has given us a pulse on balancing these two very difficult concerns, and the net of it is that we now have a very clear understanding of when it is time to favor software reliability over features, and vice-versa."
The first SLO Conference
for Site Reliability Engineers
The New Stack SRE Author Panel is part of the SLOconf. Here experts deal with questions like: What happens when an SLO goes wrong?