Postmortem - INC200663 – PaymentStream Direct (PSD) wires outage
On Wednesday, May 7 at 8:08 a.m. PT (11:08 a.m. ET), a routine change intended to improve Kubernetes service capacity unexpectedly removed critical Wires dependencies, which restricted our PaymentStream Direct Wires (PSD Wires) application from communicating with internal dependent systems. As a result, while users could submit wires for processing, and Central 1 could receive inbound wires from external Financial Institutions, we could not process them, thus they remained in pending status until system recovery ws completed. The nature of the Incident also prevented an automated recovery, requiring extensive manual intervention to restore application functionality. Full service was restored by 11:25 a.m. PT (2:25 p.m. ET).
Point of failure: The incident was caused by human error during a production deployment. A missing configuration allowed namespace deletion, resulting in widespread service impact and a complex manual recovery process.
PRB011524 – Root cause analysis actions
We recognize the disruption this caused and are committed to learning from this incident to reduce risk and improve our resiliency going forward.
If you have any questions regarding this postmortem, please contact support@central1.com