[post-mortem] 2026-04-24 1:01 - 1:06 PM PST (8:01 - 8:06 PM UTC) Cloud DB Outage

[post-mortem] 2026-04-24 1:01 - 1:06 PM PST (8:01 - 8:06 PM UTC) Cloud DB Outage

Incident Overview

Date: 2026-04-24
Time (PDT): 1:01 - 1:06 PM
Duration: 5 minutes
Severity: High

Services Impacted:

  • Cloud Portal

  • Email Notifications

  • Push Notifications

  • Cross-Site Layouts

  • Cloud Access (Desktop/Mobile Clients)

Customer Impact:

  • Desktop / Mobile Clients being unexpectedly logged out

  • Some customers cannot connect to their Sites

  • Certain API requests are experiencing high latency and increased failure rates

For the complete timeline, please see our Incident Page.

Root Cause

The Cloud DB service experienced an unexpected application crash. The service automatically restarted within seconds, but required additional time to fully reload its operational data before all requests could be served successfully.

This crash is part of a recurring pattern that the backend team is addressing.

How We Fixed It

The service recovered automatically. The container orchestration platform detected the crash and launched a replacement within 30 seconds. No manual intervention was required to restore service.

The extended recovery time is due to the service's architecture, which requires loading a large dataset into memory on startup before it can serve all request types.

Corrective Actions

Short Term

  • Recover and analyze existing crash diagnostic data to identify the specific software defect.

Long Term

  • Development team is actively rewriting the core of this service to improve resilience and reduce recovery time.