Slight Reliability
Learning SRE, one day at a time.
Podcasting since 2021 • 92 episodes
Slight Reliability
Latest Episodes
Slight Reliability Episode 90 - Non-Prod Reliability Engineering + 2024 Wrap
This week I check in and give an update on work, life, and my attempts at bringing to life SRE practices in the world of non-production environment management.You can find the official Slight Reliability podcast website at:
•
Season 2
•
Episode 90
•
18:13
Slight Reliability Episode 89 - Blameless Post-mortems with Karanveer Anand
This week I'm joined by Karanveer Anand, SRE Technical Program Manager at Google to discuss blameless post-mortems. We cover:🦅 The recent Crowdstrike outage and their public post-mortem🚑 When do we do a blameless post-mortem?😕 H...
•
Season 2
•
Episode 89
•
26:06
Slight Reliability Episode 88 - OpenTelemetry Revisited with Zach Michel
This week Zach Michel from https://middleware.io/ and I discuss the state of OpenTelemetry and what it means to adopt it. We cover:🌩️ Achieving observability in a SaaS world🥫 Context propagation ...
•
Season 2
•
Episode 88
•
26:51
Slight Reliability Episode 87 - Measuring the value of SRE with Artem Yakimenko
In Episode 80 Niall Murphy talked about the need for SREs to be better at articulating the value of our work. In this episode I'm joined by ex-Googler and Engineering Director (SRE) at Culture Amp Artem Yakimenko about how we might achieve this...
•
Season 2
•
Episode 87
•
35:33
Slight Reliability Episode 86 - Evolving SLOs with Dom Finn
In the world of SRE we constantly talk about defining SLOs, but what about evolving them over time? This week I chat with SRE Tech Lead Dom Finn about just that. We cover the relationship between reliability and user analytics, latency classes ...
•
Season 2
•
Episode 86
•
25:57