Insights & Engineering

The Anyshift Blog.

Deep dives into Site Reliability Engineering, AI in production, and scaling infrastructure gracefully. Written by the team building the future of SRE.

Browse by Category

Featured Article

What shipped at Anyshift
Changelog

What shipped at Anyshift

Linear and Notion join Annie's knowledge sources, annie-cli gains access-token authentication for headless CI use, and Annie won't recommend silencing alerts.

Anyshift
Anyshift
May 18, 2026

Latest Articles

How to Detect Terraform Drift Across Multi-Cloud
Infrastructure as CodeSeries

How to Detect Terraform Drift Across Multi-Cloud

A development RDS instance had its publicly_accessible flag flipped on a Friday afternoon. The team's drift-detection cadence was once per weekday, so 60+ hours passed before anyone caught it. Walkthrough of the audit-log subscription architecture that would have caught it in two minutes across AWS, GCP, and Azure, with every config block paste-able into your own account.

Louis Fradin
Louis Fradin
May 15, 2026 · 9 min read
How to Trace a Production Incident Back to the Commit
Production DebuggingSeries

How to Trace a Production Incident Back to the Commit

Burned 25 minutes on a Friday-morning page before I realized the responsible commit was in another team's repo. This is the four-command sequence I now run when an alert lands and `git log` on my own service comes up empty, with the outputs at each step and where the search space gets cut.

Louis Fradin
Louis Fradin
May 15, 2026 · 8 min read
Annie reads Linear now
ProductSeries

Annie reads Linear now

Forty minutes paging Linear to confirm a returning customer report was the same bug we'd half-shipped a fix for in February. The Linear integration went GA May 13, and Annie pulled both tickets, the linked PR, and the stalled action in twenty-three seconds.

Louis Fradin
Louis Fradin
May 13, 2026 · 2 min read
Annie searches Notion now
ProductSeries

Annie searches Notion now

Ten minutes to find a post-mortem already sitting in Notion. The Notion integration shipped May 12, and Annie picked the same page in eighteen seconds, root cause and open action items tagged.

Louis Fradin
Louis Fradin
May 12, 2026 · 2 min read
Annie reads Sentry now
ProductSeries

Annie reads Sentry now

Five tabs to triage one Sentry error. The Sentry integration shipped May 11, and Annie reads Sentry directly now. The first question we asked her returned 67 unresolved errors across four services.

Louis Fradin
Louis Fradin
May 11, 2026 · 2 min read
What shipped at Anyshift
ChangelogSeries

What shipped at Anyshift

Datadog gains 50+ Bits AI capabilities; PagerDuty + Incident.io + Sentry join as sources; k8s-agent v0.3.2 brings on-demand graph reconciliation.

Anyshift
Anyshift
May 11, 2026
What shipped at Anyshift
ChangelogSeries

What shipped at Anyshift

6 product areas shipped: Slack reports, MCP and CLI tools to drive Annie from your terminal, smarter automation rules, tighter AWS onboarding.

Anyshift
Anyshift
May 4, 2026
My Workers Stopped Polling: a K8s + Temporal Whodunit
Production DebuggingSeries

My Workers Stopped Polling: a K8s + Temporal Whodunit

Temporal workflows stuck in Running with zero pollers, and Temporal still reports a healthy task queue. The root cause lives one layer down: a CrashLoopBackOff in the Kubernetes worker pod, caused by a single bad environment variable. A walkthrough of debugging Temporal workers on Kubernetes the manual way (10 minutes), then with an infrastructure context layer that bridges the two systems (seconds).

Louis Fradin
Louis Fradin
Apr 8, 2026 · 6 min read
Annie CLI
ProductSeries

Annie CLI

136 CloudWatch alarms vanish overnight. Annie cross-references Slack, the audit trail, and your infra graph in one query. Now it runs in your terminal.

Stephane Jourdan
Stephane Jourdan
Mar 16, 2026 · 3 min read
5 Key Reasons You're Struggling to Debug Your Infrastructure in Under an Hour
Production DebuggingSeries

5 Key Reasons You're Struggling to Debug Your Infrastructure in Under an Hour

Most infrastructure debugging sessions blow past the one-hour mark for the same five structural reasons: scattered visibility across cloud accounts, missing historical state, terraform plan output that hides downstream impact, runbooks that lag the live infrastructure, and post-merger environments that no one has fully mapped. A walkthrough of each, with concrete examples and what reduces the time.

Roxane Fischer
Roxane Fischer
Jul 30, 2024 · 4 min read
Top 3 Weak Points in Your Infrastructure and how to mitigate them
Production Debugging

Top 3 Weak Points in Your Infrastructure and how to mitigate them

Three structural patterns recur in growing infrastructure orgs: single-repo bottlenecks where dozens of teams share one approval queue, ClickOps and dead IaC code that drift outside any state file, and module version fragmentation that quietly bypasses security patches. A walkthrough of each, with the practices that contain the blast radius.

Roxane Fischer
Roxane Fischer
Jul 30, 2024 · 3 min read

Article Series

Deep Dives

Meet Our Writers

The Contributors

Anyshift

Anyshift

Annie from Anyshift shares the latest product updates, feature launches, and news about Anyshift.

Ghazi Felhi

Ghazi Felhi

AI Engineer

Ghazi Felhi is an AI Engineer at Anyshift with a PhD in Generative AI, specializing in Language Modeling. A published AI researcher, he brings a track record of productionizing innovative AI-based solutions to Anyshift, where he works on Annie, Anyshift's AI SRE.

Louis Fradin

Louis Fradin

DevRel & Backend Engineer

Louis Fradin is a DevRel and Backend Engineer at Anyshift, where he's helping build the AI context layer for production systems, giving teams the infrastructure graph they need so AI agents can actually understand what's running in prod.

His path to SRE started deep in the stack: four years writing Linux drivers and managing HPC infrastructure for the French Ministry of Armed Forces, followed by three and a half years at Ubisoft building and operating Kubernetes clusters at scale for game servers with Go, Temporal, Talos, or OpenTelemetry.

Today he bridges that engineering background with developer advocacy, advocating for better observability primitives and smarter AI tooling for the people keeping systems alive.

Mattias Fjellstrom

Mattias Fjellstrom

Cloud Architect | Author | HashiCorp Ambassador

Mattias is a cloud architect consultant working to help customers improve their cloud environments. He has extensive experience with both the AWS and Microsoft Azure platforms and holds professional-level certifications in both. He is also a HashiCorp Ambassador and an author of a book covering the Terraform Authoring and Operations Professional certification.

Ned Bellavance

Ned Bellavance

HashiCorp Ambassador

Ned is an IT professional and educator with more than 20 years of experience in the field. He has been a helpdesk operator, systems administrator, cloud architect, and product manager. In 2019, Ned founded Ned in the Cloud LLC to work as an independent educator, creator, and consultant. Ned is a Microsoft MVP since 2017 and a HashiCorp Ambassador since 2020.

Roxane Fischer

Roxane Fischer

CEO & Co-Founder

With a passion for innovation and a deep understanding of cloud infrastructure, Roxane Fischer leads Anyshift.io with a vision to transform how companies manage and maintain their cloud environments. Her background as an ex-Lead Engineer and AI researcher gives her a unique ability to anticipate industry needs, driving Anyshift's growth by delivering solutions that prioritize efficiency, reliability, and long-term success.

Stephane Jourdan

Stephane Jourdan

CTO & Co-Founder

With over 20 years of experience in the infrastructure space, Stephane Jourdan is a true authority on building scalable, resilient systems. As the author of the Infrastructure-as-Code Cookbook and former Co-Founder & CTO at CloudSkiff (creators of driftctl, acquired by Snyk), his depth of knowledge in cloud architecture and automation is unmatched.

Stay ahead of the pager.

Get a monthly digest of our best engineering articles, SRE case studies, and Anyshift product updates. No spam, just signal.