/* Hide Spotify link by default */
#1 SRE PODCAST

Season 3 – Now Streaming

Episode 1 is live! Join our new host, Jim Hirschauer, as we explore the future of AI-native incident response. Listen on all your favourite platforms

Available on all your favourite platforms

Episodes

S
3
.
e
1
New

The Zenduty Journey, AI-Native Response, and a New Host

Reliability is about fixing things, not just resolving them. In this season premiere, we take a trip down memory lane with Vishwa to uncover the story behind Zenduty and how the "Incidentally Reliable" podcast began. Jim and Vishwa discuss the transition to Xurrent, the "needle in the haystack" problem in modern observability, and why culture—not just code—is the key to true reliability.

S
2
.
e
4
New

Once an SRE, always an SRE

In this episode, Sudarshan shares his experience leading high-performing SRE and infrastructure teams at Rippling, Twilio, Walmart, and Epsilon. He talks about reducing CI/CD costs by 60 percent, cutting on-call alerts by 65 percent, and the mindset required to build resilient systems.

S
2
.
e
3

CTRL + ALT + Scale: Building More Than Just Code

In this episode, Madhu Rawat (CTO, Xurrent) sits down with Sakshi — Co-founder and Head of Engineering at Kapstan, with leadership experience at Sumo Logic and UpGrad. They discuss the evolution of observability, building for scale, the role of AI in incident management, and what it means to lead engineering teams through change.

S
2
.
e
2

Redefining ITxM with Zenduty x Xurrent

In this episode, Phil (CPO) and Madhu (CTO) from Xurrent sit down with Vishwa and Ankur from Zenduty to talk about ITxM, building for reliability across teams, and how product and platform thinking come together in real-world incident workflows.

S
2
.
e
1

From Cart Failures to Satellite Footprints

In this episode, we speak with Deepak Rajanna, CPO at SatSure and ex-Amazon, Flipkart, xto10x, about pricing failures at scale, war room lessons from Big Billion Days, and building satellite-powered systems with SRE principles at their core.

S0
1
.
e0
10

Credit-Worthy Reliability

Catch Krishnendu Majumdar talk about his journey in the dynamic Indian startup ecosystem, strategies to build for scale from Day 1 and insights into building sustained user trust via exceptional product performance in high governance industries like credit and finance

S0
1
.
e0
9

Reliability For The Books with Niall Murphy

Catch Niall talk about graceful degradation, what startups are getting wrong about reliability and how well-thought user-experience can communicate credibility to current and potential customers

S0
1
.
e0
8

How Solomon Hykes Disrupted Deployments, From Docker to Dagger

Discover stories from the early days of Docker, the rollercoaster journey leading to 20 million active developers worldwide, the heavy crown of a tech leader and his vision to revolutionize CI/CD with Dagger today with Solomon Hykes(Co-founder of Docker and Dagger)

S0
1
.
e0
7

Behind The Seams of Myntra's Reliability - Ashutosh Sharma

Learn about the culture, the people and the processes that make our favourite fashion destination reliable withDirector of Engineering at Myntra

Incidentally Reliable Blogs

Byte sized content from the front-lines of Site Reliability.
No items found.

The Definitive Guide to AI in Service & Operations

PDF cover that says "Modernizing IT Ops with AI"