undefined — Blog

Series

Technical deep dives into Anyshift's engineering decisions, architecture, and lessons learned.

37 Articles in this series

Anyshift meets ClickHouse: make ClickStack alerts production-aware

ClickStack detects a telemetry spike. To explain its production impact, Anyshift adds the live owner, recent rollout, blast radius, and next action.

Stephane Jourdan · Jul 29, 2026 · 4 min read

OAuth connect for Anyshift integrations

Paste-an-API-token onboarding is out for the common path. Connect with Atlassian, Linear, Notion, Sentry, GitLab, and incident.io now run OAuth with backend-owned refresh, while personal tokens stay as the fallback for self-hosted and automation.

Louis Fradin · Jul 22, 2026 · 4 min read

Anyshift meets Sentry: map one release across every affected project

A Sentry release needs a list of projects. The Anyshift Graph SDK maps a deployment's downstream production impact so the same release can reach every affected Sentry project.

Louis Fradin · Jul 21, 2026 · 4 min read

The Anyshift Graph API

Every question your team asks about production now has an API. Rank a flapping monitor in 0.4 seconds, trace a bad deploy to its failures, and give your agents ground truth instead of another dashboard.

Roxane Fischer · Jul 15, 2026 · 4 min read

How Much Context Does an AI Agent Really Need?

Unified production context makes AI agents cheaper, faster, and easier to scale. In our live benchmark, resolving relationships before inference cut model-token usage by 83.5%, while reducing response time and estimated cost without changing the task outcome.

Roxane Fischer · Jul 13, 2026 · 7 min read

Anyshift meets Harness: production-aware approvals

Harness runs the production release pipeline and manual approval gate for the checkout-api deployment. Anyshift adds the production impact: affected services, owners, recent changes, and the review decision before the approval waits for a human.

Stephane Jourdan · Jun 29, 2026 · 4 min read

Anyshift meets Dash0: write production-change context into service timelines

Dash0 shows the telemetry. Anyshift writes the upstream production-change context into Dash0 as OpenTelemetry service events for every affected service.

Stephane Jourdan · Jun 19, 2026 · 3 min read

Anyshift meets Redis: Production context before AI agents act

Anyshift adds live production context to AI agents using Redis, then writes the enriched context back to Redis before an AI agent acts.

Stephane Jourdan · Jun 19, 2026 · 4 min read

Anyshift meets ServiceNow: production context for incident workflows

ServiceNow manages incidents, changes, approvals, and tasks. Anyshift adds production context so those workflows can include cause, blast radius, and owner review.

Stephane Jourdan · Jun 18, 2026 · 3 min read

Anyshift meets Postman: production-impact API checks before release gates run

Postman runs API collections with environments. Anyshift adds the live production context that tells those workflows which API paths, consumers, owners, and monitors actually matter before a release gate runs.

Stephane Jourdan · Jun 15, 2026 · 3 min read

Prove Which Snyk Container CVEs Are Running in Production

Snyk Container identifies vulnerable image digests. Anyshift joins each digest to Kubernetes runtime, exposure, rollout history, owner, and remediation window.

Roxane Fischer · Jun 15, 2026 · 3 min read

Break our demo infrastructure on purpose and watch the root cause surface

Sever the link between a service and its database in our new Playground, and a change event lands. Seconds later the root cause comes back, traced against the topology graph instead of a wall of logs. A hands-on way to see what change-first root-cause analysis does, no signup.

Louis Fradin · Jun 12, 2026 · 2 min read

Anyshift meets CrowdStrike: safer threat response with production context

CrowdStrike Falcon helps security teams decide what to do with suspicious domains, IPs, and files. Anyshift shows which services, owners, dependencies, and recent deploys are behind the signal before analysts detect, block, or escalate it.

Stephane Jourdan · Jun 10, 2026 · 4 min read

Approve Confluent schema changes with production impact

Confluent validates and registers Kafka schema changes. Anyshift adds the production impact: affected services, owners, monitors, and skipped non-production paths.

Stephane Jourdan · Jun 10, 2026 · 4 min read

AI Context for Prod, Optimized by AIs in Prod

Annie (our AI SRE agent) had institutional memory from ACE, the agentic context-engineering loop that curates cheatsheets from past runs. It worked, but clients kept catching her trusting stale entries or missing answers buried in her own bloated context. So we added five things on top: (1) a fixed set of memory items always presented to the agent, (2) per-query retrieval over the rest of the memory store, (3) an agent-optimized index of that store, (4) the ability for the agent to query the store mid-run, and (5) tried-and-true memory freshness mechanisms. Production context, now optimized by the AI using it. Here's the reasoning and what a few weeks in production say.

Ghazi Felhi · Jun 10, 2026 · 10 min read

Anyshift meets Coralogix: turning telemetry into reviewed production handoffs

Coralogix is where SREs investigate telemetry. Anyshift adds the production graph around a signal: affected service, owner, recent deploy, dependency evidence, and skip reasons, then writes the reviewed handoff into a Coralogix Custom Dashboard.

Stephane Jourdan · Jun 9, 2026 · 4 min read

How we turned on-call judgment into skills an AI agent can load

An AI agent in the incident channel can run kubectl and read a dashboard. What it can't do is judge whether the last deploy is the suspect or a red herring. We open-sourced the SRE skills that encode that judgment, runnable offline against fixtures with no credentials.

Louis Fradin · Jun 9, 2026 · 4 min read

Anyshift meets MongoDB Atlas: production-aware alert settings

MongoDB Atlas can alert when a cluster nears its connection limit. Anyshift adds the pre-enable review: affected services, owners, monitors, recent changes, and non-production exclusions before paging starts.

Stephane Jourdan · Jun 9, 2026 · 4 min read

Anyshift meets Snowflake: production context before agents act

Snowflake is where teams govern data, workloads, and AI workflows. Anyshift adds the live production graph those workflows need before they apply a fix, rerun a task, refresh a dynamic table, or trigger an agentic workflow.

Stephane Jourdan · Jun 8, 2026 · 4 min read

Anyshift meets Databricks: checking production impact before a data pipeline rerun

Databricks gives teams the governed data and AI surface. Anyshift adds the live production context a Databricks workflow needs before it patches a data pipeline, reruns a backfill, or calls an agent tool.

Stephane Jourdan · Jun 8, 2026 · 4 min read

Anyshift meets GitLab: production impact before merge

GitLab shows reviewers the diff, pipelines, and approvals. Anyshift adds the missing production layer: which live services use the changed code, who owns them, what can be skipped, and who should review before merge.

Stephane Jourdan · Jun 8, 2026 · 4 min read

Anyshift meets Okta: production reachability for access changes

Okta is where teams manage identity, access, and policy. Anyshift adds production reachability enrichment around an access change: which services, cloud roles, Kubernetes workloads, monitors, and owners sit behind the group before Okta performs the assignment.

Stephane Jourdan · Jun 8, 2026 · 3 min read

Anyshift meets Elastic: debug with PR context already attached

Elastic gives teams the place to search, triage, and open Cases (Kibana investigation tickets) when an incident starts. For a PR that changes a shared authentication module, Anyshift adds what Elastic cannot infer from the PR alone: which production services depend on it, who owns them, Identity hints, and evidence. So when a human or agent starts debugging, the context is already attached.

Stephane Jourdan · Jun 4, 2026 · 3 min read

Private Equity's AI Transition Problem in 2026

Private equity firms are pushing AI across portfolios, but the hard part is no longer experimentation. It is turning sponsor-level AI pressure into governed, production-aware operating workflows that actually move EBITDA.

Roxane Fischer · Jun 2, 2026 · 8 min read

Anyshift meets Splunk: reduce maintenance alert fatigue

Planned maintenance often creates alert noise. Anyshift finds the Splunk alerts affected by a change, pauses only those saved searches, and turns them back on when the window ends. Teams keep real alerts visible while expected noise stays out of the way.

Louis Fradin · Jun 1, 2026 · 3 min read

Anyshift meets Dynatrace: graph context for every deploy

A deployment event should carry the service, owner, and monitored entity it actually changed. Anyshift adds that production context to Dynatrace so on-call teams do not rebuild it from CI and infrastructure tabs.

Stephane Jourdan · May 28, 2026 · 3 min read

New Relic Change Tracking with production impact from Anyshift

A shared ConfigMap changes, and latency rises. New Relic shows the timing; Anyshift finds the eight workloads in the blast radius. See how both become part of one Change Tracking event.

Stephane Jourdan · May 27, 2026 · 4 min read

Anyshift meets acli: PR impact, routed into Jira

A shared-code PR should not surprise downstream teams after merge. Anyshift finds the running services and owners affected by the change, then routes the advisory work into Jira before the review is over.

Stephane Jourdan · May 26, 2026 · 4 min read

Anyshift meets pup: turning intent into audited Datadog runbooks

Datadog pup can mute monitors during maintenance, but teams still have to know which downstream services will be noisy. Anyshift CLI maps the affected services from production context, then prepares the Datadog downtime runbook with an audit trail.

Stephane Jourdan · May 22, 2026 · 4 min read

Anyshift meets gcx: cloning Grafana observability with audited runbooks

Grafana shows the service you instrumented, but downstream services often miss the same dashboards and SLOs. Anyshift maps the dependency graph, finds the coverage gaps, and prepares the Grafana resources for gcx to apply after review.

Stephane Jourdan · May 21, 2026 · 4 min read

Annie reads Linear now

Forty minutes paging Linear to confirm a returning customer report was the same bug we'd half-shipped a fix for in February. The Linear integration went GA May 13, and Annie pulled both tickets, the linked PR, and the stalled action in twenty-three seconds.

Louis Fradin · May 13, 2026 · 2 min read

Annie searches Notion now

Ten minutes to find a post-mortem already sitting in Notion. The Notion integration shipped May 12, and Annie picked the same page in eighteen seconds, root cause and open action items tagged.

Louis Fradin · May 12, 2026 · 2 min read

How we now know which commit broke each Sentry error

Five Sentry tickets in one worker turned out to be one bug. The most-recent error came from the very PR that had wired Sentry forwarding in. How a stack frame now leads to the offending commit, the deploy behind it, and the team that owns the failing path.

Louis Fradin · May 11, 2026 · 2 min read

Report Templates: pre-built investigations, one click

Every Monday, the pod-stability review gets rebuilt from scratch. Same dashboards, same correlation work, same write-up. Two hours, gone. Report Templates turn the recurring investigations platform and SRE teams run by hand into one click.

Louis Fradin · Apr 15, 2026 · 2 min read

Annie CLI

136 CloudWatch alarms vanish overnight. Annie cross-references Slack, the audit trail, and your infra graph in one query. Now it runs in your terminal.

Stephane Jourdan · Mar 16, 2026 · 3 min read

Agentic Context Engineering in Production: How AI Agents Build Institutional Expertise

AI agents start every run from scratch. ACE (Agentic Context Engineering) gives them institutional memory that evolves through use, cutting root cause analysis time by 30%.

Ghazi Felhi · Mar 11, 2026 · 8 min read

Building a Temporal Infrastructure Knowledge Graph: A Year of Working with Neo4j at Scale

How Anyshift chose Neo4j for building a temporal infrastructure knowledge graph and lessons learned over a year of production use.

Stephane Jourdan · Feb 17, 2026 · 14 min read

← Back to all posts