-
In partnership with incident.ioDon’t wait for an outage to improve your reliability
A reliable system isn’t just about your infrastructure. Learn to effectively leverage game days to train engineers and build resilience before real incidents force you to.
-
In partnership with Skiller WhaleAI productivity at enterprise scale
Turning AI adoption into real productivity gains requires changing habits, building skills, and scaling learning across the entire organization.
-
Taming the legacy system monster
Learn how to document, stabilize, and modernize legacy systems without risky rewrites or single-person dependencies.
-
In partnership with DockerShip 10x code safely with agents
Learn how to safely turn AI-generated code into shipped software without slowing developers or compromising enterprise governance.
-
The Staff Engineer’s playbook: Intellectual shift to systemic impact
A practical framework for making the Staff Engineer leap, shifting from solving tasks to defining strategy and delivering lasting systemic impact.
-
Most MCP servers are collecting dust. Why and how to avoid that.
How to design MCP servers that deliver the right context, stay secure, and actually get used in real workflows.
-
Building resilient engineering teams when failure is the default
How teams operating under constant failure build resilient systems and cultures by assuming outages are normal, not exceptional.
-
Updatable repos: Duolingo’s journey to a golden path
A real-world story of reducing microservice sprawl by embedding best practices into templates that teams actually want to adopt.
-
What does a CTO even do?
A reverse-engineering framework to help engineering leaders broaden impact, break growth ceilings, and develop the skills required for senior technical leadership.
-
How we introduced quality engineering and made everything better!
How modern quality engineering reduced deployment time dramatically while improving developer experience through iterative, sustainable change.