Cloud Management Platform - NAM - Self-Service , CWF, and CM Shard 3 & 4 - Performance Degradation

Incident Report for Flexera System Status Dashboard

Postmortem

Description: Cloud Management Platform - NAM - Self-Service , CWF, and CM Shard 3 & 4 - Performance Degradation

Timeframe: October 27, 2025, 9:30 PM PST to October 27, 2025, 11:38 PM PST

Incident Summary

 

On Monday, October 27, 2025, at 9:30 PM PST, our teams identified performance degradation affecting the Self Service, CWF, and Cloud Management (CM) functionalities on Shard 3 and Shard 4 of the Cloud Management Platform in the North America region. This issue arose following a scheduled maintenance activity, during which the database infrastructure was downgraded to a lower instance type as part of our strategy.

During the post-maintenance validation, the teams observed a slowdown across the CMP platform, resulting in some operations taking longer than anticipated to complete. Consequently, customers in the NAM region may have experienced reduced performance or delays while using Self Service, CWF, and CM functionalities.

To address the degradation, we scaled up the database infrastructure to a higher resource configuration, which successfully restored normal performance on Shards 3 and 4 by 10:00 PM PST. We continued to monitor the situation closely, and by 11:38 PM PST, all affected functionalities, including Self Service, CWF, and CM, were confirmed to be fully restored and operating normally.

Root Cause

 

During the scheduled maintenance, the database infrastructure supporting Shard 3 and Shard 4 was downgraded to a lower instance type. This change was based on successful validation performed in the staging environment using the same configuration.

However, the production environment necessitated greater computational resources due to a surge in workload and concurrent user activity. Consequently, the lower instance type proved inadequate for the demands of the production environment, resulting in elevated resource utilization, database latency, and overall performance degradation on the CMP platform.

Remediation Actions

 

·        Our teams initiated an investigation immediately upon detection of performance degradation. Database performance metrics were analyzed, revealing resource contention and high latency.

·        The database instances were scaled up to a higher resource configuration to restore normal performance levels.

·        Continuous post-restoration monitoring was carried out to ensure stability and validate recovery across all impacted functionalities.

Future Preventative Measures

 

·        Enhanced Pre-Production Testing- Conduct comprehensive load and performance testing in pre-production environments that accurately replicate production-scale workloads before implementing infrastructure changes.

·        Extended Maintenance Windows- Plan for larger maintenance windows for infrastructure-level changes to ensure sufficient time for post-maintenance validation and rollback, if necessary.

Posted Nov 11, 2025 - 12:46 PST

Resolved

The issue affecting the impacted services has been fully resolved. Our teams have completed all validations successfully and confirmed that Self Service, CWF, and CM on Shards 3 and 4 have returned to normal operation.
Posted Oct 28, 2025 - 00:45 PDT

Monitoring

A fix has been implemented, and our teams are observing improvement in service performance. We are currently performing additional validations to ensure the issue is fully resolved and services are operating as expected. Further updates will be shared once validation is complete.
Posted Oct 27, 2025 - 22:36 PDT

Investigating

Issue Description: We are currently investigating an issue affecting Self Service, CWF, and CM on Shards 3 and 4 within the Cloud Management Platform in the North America (NAM) region. Following the completion of scheduled maintenance, our teams observed performance degradation and slowness in these services during post-maintenance monitoring.

Priority: P2

Restoration Activity: Our technical teams are actively engaged and investigating the issue. We are exploring potential solutions to restore functionality as quickly as possible and will provide further updates as they become available.
Posted Oct 27, 2025 - 21:57 PDT
This incident affected: Legacy Cloud Management (Automation, Cloud Management Dashboard - Shard 3, Cloud Management Dashboard - Shard 4, Self-Service - Shard 3, Self-Service - Shard 4).