Skip to content

Latest commit

 

History

History
11 lines (9 loc) · 817 Bytes

adc3b69d-9cb8-40d7-941c-f63316a72eb6.md

File metadata and controls

11 lines (9 loc) · 817 Bytes
uuid url categories company product
adc3b69d-9cb8-40d7-941c-f63316a72eb6
postmortem
Amazon

At 7:30 AM PST, an automated activity to scale capacity of one of the AWS services hosted in the main AWS network triggered an unexpected behavior from a large number of clients inside the internal network. This resulted in a large surge of connection activity that overwhelmed the networking devices between the internal network and the main AWS network, resulting in delays for communication between these networks. These delays increased latency and errors for services communicating between these networks, resulting in even more connection attempts and retries. This led to persistent congestion and performance issues on the devices connecting the two networks.