Description: Flexera One – IT Asset Management – EU – Service Disruption
Timeframe: July 23, 2025, 1:07 PM to 1:38 PM PDT
Incident Summary
On July 23, 2025, at 1:07 PM PDT, Flexera One IT Asset Management in the Europe region experienced a service disruption. During this period, customers may have been unable to log in or access key workflows within IT Asset Management.
The disruption occurred shortly after a release window. While initial access was confirmed as available, the underlying service components encountered readiness issues, leading to both servers being marked unavailable. Automated recovery mechanisms initiated replacement servers, restoring access by 1:38 PM PDT.
Comprehensive checks confirmed full functionality following restoration, and no direct customer reports were received during the incident.
Root Cause
Primary Root Cause:
The disruption was triggered by a readiness failure in the service components responsible for handling customer access. The affected components did not transition to a fully operational state, which caused temporary unavailability until recovery mechanisms provisioned new healthy servers.
Contributing Factors:
• Timing of Health Readiness: The service health checks may not have allowed sufficient time for new servers to become fully operational after the release window.
• Startup Process Completion: A multi step initialization process may not have fully completed, preventing servers from passing readiness checks.
• Simultaneous Failures: Both servers failed around the same time, amplifying the impact and leaving no fallback available until recovery mechanisms were triggered.
Remediation Actions
Future Preventative Measures
Following this incident, a detailed root cause analysis and internal retrospective were completed to identify areas for long term improvement. The following workstreams were initiated under a platform reliability initiative aimed at strengthening performance and minimizing the risk of recurrence. While the underlying cause of the service disruption remains under investigation, the actions already implemented and those underway are expected to significantly improve the platform’s ability to detect, respond to, and recover from similar events in the future.