Newsletter

A monthly dispatch on platform engineering, DevOps, SRE, and cloud infrastructure. Real problems, real solutions, no fluff.

What You'll Get Each Month

One email per month. Every issue covers a single infrastructure topic in depth, plus quick wins you can ship the same day. Written for engineers who build and operate production systems.

Deep Dives

Incident teardowns, architecture decisions, and tool evaluations with real production context.
Quick Wins

Actionable tips with code snippets and config examples you can use immediately.
Curated Links

Post-mortems, RFCs, and technical posts worth your time, with annotations on why.
Monthly Challenge

A specific experiment to run in your own environment to improve reliability or observability.

No spam, no sales pitches. Just engineering content from someone who operates infrastructure for a living. Unsubscribe anytime with one click.

Monthly Dispatch

Topics We Cover

Practical content across the infrastructure stack, written from hands-on experience with CI/CD pipelines, Kubernetes clusters, observability stacks, and cloud platforms.

Platform Engineering

Internal developer portals, golden paths, self-service infrastructure, and platform team metrics.

DevOps & CI/CD

Pipeline optimization, monorepo build strategies, deployment patterns, and feature flag infrastructure.

SRE & Reliability

Error budgets, on-call design, chaos engineering, incident response, and SLO-based alerting.

Cloud Engineering

Cost optimization, IaC patterns, multi-cloud networking, and cloud provider comparisons without vendor bias.

What an Issue Looks Like

Here's a preview of a typical newsletter issue.

webstack-dispatch-001.md

## Deep Dive: The Kubernetes DNS Bug That Wasted 40 Hours

How the default ndots:5 setting caused cascading DNS failures under load, the misleading metrics that sent us down the wrong path, and the two-line fix that resolved it.

## Quick Wins

- Terraform state backup one-liner before every apply

- PromQL query to find unused Grafana dashboards

- Pre-commit hook for secrets scanning in monorepos

## What We're Reading

- Cloudflare's routing incident post-mortem

- OpenTelemetry's new tail-sampling processor

- Google SRE on SLO-based alerting

## Try This Month

Check your CoreDNS cache hit rate. Below 80%? You're hammering upstream resolvers unnecessarily.

$ 5 min read · unsubscribe anytime