Newsletter
A monthly dispatch on platform engineering, DevOps, SRE, and cloud infrastructure. Real problems, real solutions, no fluff.
What You'll Get Each Month
One email per month. Every issue covers a single infrastructure topic in depth, plus quick wins you can ship the same day. Written for engineers who build and operate production systems.
-
Deep Dives
Incident teardowns, architecture decisions, and tool evaluations with real production context.
-
Quick Wins
Actionable tips with code snippets and config examples you can use immediately.
-
Curated Links
Post-mortems, RFCs, and technical posts worth your time, with annotations on why.
-
Monthly Challenge
A specific experiment to run in your own environment to improve reliability or observability.
No spam, no sales pitches. Just engineering content from someone who operates infrastructure for a living. Unsubscribe anytime with one click.
Stay Updated
Platform engineering insights delivered to your inbox once a month. No spam, no sales pitches.
Topics We Cover
Practical content across the infrastructure stack, written from hands-on experience with CI/CD pipelines, Kubernetes clusters, observability stacks, and cloud platforms.
Platform Engineering
Internal developer portals, golden paths, self-service infrastructure, and platform team metrics.
DevOps & CI/CD
Pipeline optimization, monorepo build strategies, deployment patterns, and feature flag infrastructure.
SRE & Reliability
Error budgets, on-call design, chaos engineering, incident response, and SLO-based alerting.
Cloud Engineering
Cost optimization, IaC patterns, multi-cloud networking, and cloud provider comparisons without vendor bias.
What an Issue Looks Like
Here's a preview of a typical newsletter issue.
## Deep Dive: The Kubernetes DNS Bug That Wasted 40 Hours
How the default ndots:5 setting caused cascading DNS failures under load, the misleading metrics that sent us down the wrong path, and the two-line fix that resolved it.
## Quick Wins
- Terraform state backup one-liner before every apply
- PromQL query to find unused Grafana dashboards
- Pre-commit hook for secrets scanning in monorepos
## What We're Reading
- Cloudflare's routing incident post-mortem
- OpenTelemetry's new tail-sampling processor
- Google SRE on SLO-based alerting
## Try This Month
Check your CoreDNS cache hit rate. Below 80%? You're hammering upstream resolvers unnecessarily.
$ 5 min read · unsubscribe anytime