-
Shipping secure, reliable and high performance AI agents
A hands-on engineering guide to building AI agents that stay secure, reliable, and fast in real, high-stakes production systems.
-
One silver bullet: Adaptability
Hire for adaptability over pedigree, and build teams that can learn, grow, and deliver in a changing industry.
-
In partnership with incident.ioDon’t wait for an outage to improve your reliability
A reliable system isn’t just about your infrastructure. Learn to effectively leverage game days to train engineers and build resilience before real incidents force you to.
-
In partnership with Skiller WhaleAI productivity at enterprise scale
Turning AI adoption into real productivity gains requires changing habits, building skills, and scaling learning across the entire organization.
-
Taming the legacy system monster
Learn how to document, stabilize, and modernize legacy systems without risky rewrites or single-person dependencies.
-
In partnership with DockerShip 10x code safely with agents
Learn how to safely turn AI-generated code into shipped software without slowing developers or compromising enterprise governance.
-
The Staff Engineer’s playbook: Intellectual shift to systemic impact
A practical framework for making the Staff Engineer leap, shifting from solving tasks to defining strategy and delivering lasting systemic impact.
-
Most MCP servers are collecting dust. Why and how to avoid that.
How to design MCP servers that deliver the right context, stay secure, and actually get used in real workflows.
-
Building resilient engineering teams when failure is the default
How teams operating under constant failure build resilient systems and cultures by assuming outages are normal, not exceptional.
-
Updatable repos: Duolingo’s journey to a golden path
A real-world story of reducing microservice sprawl by embedding best practices into templates that teams actually want to adopt.