Elevated latency on Root-cause agent (EU-1)
2026-04-10 14:12 UTC
Duration · 35 min
Summary
Queue backpressure on the EU-1 inference endpoint pushed p95 completion time from 4.2s to 11s. No completions were dropped; all drafts landed in Slack, just late.
14:47 UTC
Resolved
Queue has drained, p95 back under SLO for 20 minutes. Post-incident review scheduled for 2026-04-14.
14:29 UTC
Identified
Root cause is a slow upstream response from a single model replica. Traffic has been shifted to healthy replicas; p95 recovering.
14:12 UTC
Investigating
Elevated latency on the Root-cause agent in EU-1. Drafts are still being generated, but are arriving in Slack 2–6x slower than usual.