All Systems Operational

Production Operational
90 days ago
97.79 % uptime
Today
Authentication Operational
90 days ago
99.99 % uptime
Today
User Management Operational
90 days ago
94.52 % uptime
Today
Products Management Operational
90 days ago
98.86 % uptime
Today
AI Services - US Operational
90 days ago
99.99 % uptime
Today
Azure OpenAI ? Operational
90 days ago
99.99 % uptime
Today
Database ? Operational
90 days ago
100.0 % uptime
Today
Authentication ? Operational
90 days ago
100.0 % uptime
Today
AI Services - EU Operational
90 days ago
99.86 % uptime
Today
Azure OpenAI ? Operational
90 days ago
99.6 % uptime
Today
Database ? Operational
90 days ago
100.0 % uptime
Today
Authentication ? Operational
90 days ago
100.0 % uptime
Today
AI Services - AP Operational
90 days ago
100.0 % uptime
Today
Azure OpenAI ? Operational
90 days ago
100.0 % uptime
Today
Database ? Operational
90 days ago
100.0 % uptime
Today
Authentication ? Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
May 18, 2025

No incidents reported today.

May 17, 2025

No incidents reported.

May 16, 2025
Resolved - https://app.azure.com/h/4TSS-3VG/6711e7

What Happened
Between 01:33 UTC and 02:38 UTC on 16 May 2025, a platform-level issue within the Azure OpenAI Service led to service interruptions affecting downstream integrations. During this period, our services that rely on Azure OpenAI experienced intermittent errors and degraded performance, including HTTP 500 responses.

Root Cause
Microsoft has identified that the underlying cause was an increase in total request size on their backend, which triggered container restarts due to memory limits being exceeded. This resulted in one of their backend resources becoming unhealthy and unable to process requests reliably.

Timeline of Response

01:33 UTC – Azure platform issue began impacting dependent services

01:49 UTC – Issue was automatically detected by Microsoft’s monitoring systems; Azure engineers were engaged

01:58 UTC – Root cause was isolated to memory-related container restarts within Azure OpenAI infrastructure

02:12 UTC – Microsoft engineers initiated mitigation by scaling out the affected backend systems

02:38 UTC – Full service restoration confirmed by Microsoft

Impact on Our Services
While our infrastructure and application code remained stable, availability of AI-powered features was temporarily degraded due to upstream dependency on Azure OpenAI. No data loss or permanent service degradation occurred within our environment.

May 16, 04:33 UTC
May 15, 2025

No incidents reported.

May 14, 2025

No incidents reported.

May 13, 2025

No incidents reported.

May 12, 2025

No incidents reported.

May 11, 2025

No incidents reported.

May 10, 2025

No incidents reported.

May 9, 2025

No incidents reported.

May 8, 2025

No incidents reported.

May 7, 2025

No incidents reported.

May 6, 2025

No incidents reported.

May 5, 2025

No incidents reported.

May 4, 2025

No incidents reported.