Description: Flexera One – SaaS Manager – NA – Managed Applications Not Loading
Timeframe: November 4th, 2:38 AM PDT to November 4th, 9:54 AM PDT
Incident Summary:
On Friday, November 4th, at 2:38 AM PDT, we received reports of performance degradation in the SaaS Manager application in the NA region. Customers were able to access the Managed SaaS Applications menu, but when they attempted to launch any application from the menu, it stayed stuck on loading and did not load the results.
After further investigation, technical staff observed some resource contention issues. To alleviate the issue, one of the web app servers was rebooted at 4:37 AM PDT. It provided some relief, however, some of the services were still in an unhealthy state causing performance issues in the application.
After further investigation, technical staff found that one of the worker nodes was in an unhealthy state. Any requests going into pods hosted on this node resulted in failure. At 9:54 AM PDT, the impacted node was drained, and pods were moved to healthy nodes. Health checks confirmed that impacted services were now restored and functional.
Staff continued to monitor the services for the next few hours to ensure stability. After further health checks and monitoring, the incident was declared resolved.
Root Cause:
Investigation revealed that one of the worker nodes was in an unhealthy state. Any requests going into pods hosted on this node resulted in failure.
Corrective Action: