Incidentally Reliable podcast cover art styled as a vinyl record album with a purple siren graphic, made by Xurrent, with...

Light gray audio waveform visualization on a white background representing a podcast or audio recording soundwave pattern.

#1 SRE PODCAST

Season 3 – Now Streaming

Episode 1 is live! Join our new host, Jim Hirschauer, as we explore the future of AI-native incident response. Listen on all your favourite platforms

Available on all your favourite platforms

FEATURED PODCAST

The Zenduty Journey, AI-Native Response, and a New Host

Listen to a special throwback episode where new host Jim Hirschauer sits down with SRE veteran Vishwa. They dive deep into the origins of Zenduty, why observability tools often flood you with too much data, and why you can't use tooling to fix culture. A perfect start to Season 3.

Listen now on

Spotify-style player showing Incidentally Reliable Podcast episode 1 titled The Zenduty Journey AI-Native Response and a...

Episodes

New

The Zenduty Journey, AI-Native Response, and a New Host

Reliability is about fixing things, not just resolving them. In this season premiere, we take a trip down memory lane with Vishwa to uncover the story behind Zenduty and how the "Incidentally Reliable" podcast began. Jim and Vishwa discuss the transition to Xurrent, the "needle in the haystack" problem in modern observability, and why culture—not just code—is the key to true reliability.

New

Once an SRE, always an SRE

In this episode, Sudarshan shares his experience leading high-performing SRE and infrastructure teams at Rippling, Twilio, Walmart, and Epsilon. He talks about reducing CI/CD costs by 60 percent, cutting on-call alerts by 65 percent, and the mindset required to build resilient systems.

CTRL + ALT + Scale: Building More Than Just Code

In this episode, Madhu Rawat (CTO, Xurrent) sits down with Sakshi — Co-founder and Head of Engineering at Kapstan, with leadership experience at Sumo Logic and UpGrad. They discuss the evolution of observability, building for scale, the role of AI in incident management, and what it means to lead engineering teams through change.

Redefining ITxM with Zenduty x Xurrent

In this episode, Phil (CPO) and Madhu (CTO) from Xurrent sit down with Vishwa and Ankur from Zenduty to talk about ITxM, building for reliability across teams, and how product and platform thinking come together in real-world incident workflows.

From Cart Failures to Satellite Footprints

In this episode, we speak with Deepak Rajanna, CPO at SatSure and ex-Amazon, Flipkart, xto10x, about pricing failures at scale, war room lessons from Big Billion Days, and building satellite-powered systems with SRE principles at their core.

GoDaddy's Journey to Hosting Reliability — Incidentally Reliable Podcast with Amit Rindhe

In this episode of Incidentally Reliable, we sit down with Amit Rhinde, Head of Engineering at GoDaddy, to uncover the secrets behind building resilient systems, scaling global operations, and ensuring uptime for millions of users.

Press Start to Scale: SRE in Gaming - Incidentally Reliable with Denys Pashutynski

In our latest episode, we speak with Denys Pashutynski, Senior Engineering Manager of Site Reliability at Roblox, about the formidable challenges of sustaining a global gaming platform. Drawing from his tenure at Twitter, AWS, and eBay, Denys delves into managing traffic surges, latency optimization, and strategic change management.

Battle-Tested Reliability Strategies with Abhishek Ghosh

We dive into the trenches with Abhishek Ghosh, a veteran who has led SRE teams at Pinterest, and now at Cribl. He shares gripping war room stories from Pinterest, strategies for maintaining uptime, insights into the role of AI in observability, and more! Discover the future of SRE and learn how to navigate the challenges of digital reliability. Tune in to gain valuable lessons from one of the industry's leading experts.

The Science of Building Cloud Native DevTools

Catch Ramiro Berrelleza — Founder and CEO at Okteto talk about how impactful DevTool startups are built, the importance of investing in Developer Experience, and the emerging issues with the Cloud Native ecosystem

Meet the Veterans

Peek into their journey so far, manoeuvred nightmares, their war-room stories and opinions on the current state of the space.

Incidentally Reliable Blogs

Byte sized content from the front-lines of Site Reliability.

News

A Letter From Our CEO: Our Path Forward

When I joined Xurrent as CEO in February, I made a commitment to myself before I made any commitments publicly: I would listen before I led. I would get in front of customers, sit down with our partners, and spend real time with the incredible team that built this platform — before I said a word about where I thought we were going.

3 Min Read

March 9, 2026

News

Docker Lessons from Solomon Hykes on Reliability

Lessons from Docker and a practical path to reliable pipelines with Dagger functions focused on MTTR security tradeoffs and developer experience

15 Min Read

August 29, 2025

News

The Reliability Stories You Won’t Hear on LinkedIn

We had the pleasure of meeting Ponmani Palanisamy, a Staff Site Reliability Engineer at LinkedIn, at a recent SRE Meetup in Bangalore. Ponmani gave an insightful talk on "Improving data redundancy and rebalancing data in HDFS." We were cap

10 Min Read

May 24, 2024

The Definitive Guide to AI in Service & Operations

Get Started Today

PDF cover that says "Modernizing IT Ops with AI"

ITxM Platform

Status Pages

iPaaS

Season 3 – Now Streaming

The Zenduty Journey, AI-Native Response, and a New Host

Episodes

The Zenduty Journey, AI-Native Response, and a New Host

Once an SRE, always an SRE

CTRL + ALT + Scale: Building More Than Just Code

Redefining ITxM with Zenduty x Xurrent

From Cart Failures to Satellite Footprints

GoDaddy's Journey to Hosting Reliability — Incidentally Reliable Podcast with Amit Rindhe

Press Start to Scale: SRE in Gaming - Incidentally Reliable with Denys Pashutynski

Battle-Tested Reliability Strategies with Abhishek Ghosh

The Science of Building Cloud Native DevTools

Meet the Veterans

Jim Hirschauer

Rajesh Tilwani

Suresh Kumar Khemka

Manan Verma

Ashutosh Sharma

Viraj Patel

Piyush Verma

Krishnendu Majumdar

Manoj Sebastian Kulatharayil

Solomon Hykes

Niall Murphy

Ramiro Berrelleza

Abhishek Ghosh

Denys Pashutynski

Amit Rindhe

Deepak Rajanna

Phil Christianson

Madhu Rawat

Sakshi Jain

Sudarshan Balakrishna

Incidentally Reliable Blogs

A Letter From Our CEO: Our Path Forward

Docker Lessons from Solomon Hykes on Reliability

The Reliability Stories You Won’t Hear on LinkedIn

The Definitive Guide to AI in Service & Operations