Resolved -
Following an extended period of monitoring, our teams have confirmed that services have remained stable. The external service provider has also confirmed full resolution of their outage. This incident is now resolved, and a detailed post-mortem report will be shared with additional insights.
May 8, 22:23 PDT
Update -
Our technical teams have completed internal stabilization actions to address the earlier service impact, and Spot services have remained stable for the last several hours. We have not observed any recurrence of customer-facing impact, and instance provisioning behavior continues to operate as expected.
We are continuing to monitor the environment and our service provider’s updates closely. The incident will remain in monitoring for the next few hours, with closure to follow once this additional monitoring period is complete.
May 8, 12:39 PDT
Update -
Some customers may experience residual issues following the recent fix. Our technical teams are actively working to address these and ensure full service stability. We will provide further updates as more information becomes available.
May 8, 01:25 PDT
Monitoring -
Our teams have implemented mitigation measures to address the impact from the external service disruption, and services have now been restored. We are actively monitoring the platform to ensure continued stability and will provide further updates as more information becomes available.
May 7, 23:11 PDT
Identified -
Incident Description: We are currently investigating multiple issues impacting SPOT services.Customers may experience difficulties launching instances in the affected region, resulting in degraded provisioning capabilities. Initial findings indicate that the issue is related to an ongoing disruption within our cloud service provider, specifically in the us-east-1 region. As a result, customers may face difficulties launching instances, resulting in degraded service performance.
Priority: P2
Restoration Activity: Our technical teams are actively engaged and are working with the service provider to monitor the ongoing regional issue. We are assessing the impact on dependent services, including database connectivity and request processing, while identifying potential mitigation options. Further updates will be provided as more information becomes available and as service stability improves.
May 7, 20:00 PDT