iaun
iaun
发布于 2025-10-20 / 390 阅读
17
0

2025/10/20-AWS 美东 us-east-1 区域多服务不可用

北京时间2025年10月20日15时11分,AWS us-east-1 区域出现多服务不可用的情况

奖池还在累加……


AWS 公告称,根本原因可能与 DNS 解析问题有关,尤其是 DynamoDB API 端点的 DNS 解析出现障碍。


北京时间2025年10月20日18点35分时已缓解,约3小时24分钟,还在慢慢恢复中,如下是影响的服务:


北京时间2025年10月21日上午6点53分时已完全恢复,历时15小时42分钟

Oct 20 3:53 PM PDT Between 11:49 PM PDT on October 19 and 2:24 AM PDT on October 20, we experienced increased error rates and latencies for AWS Services in the US-EAST-1 Region. Additionally, services or features that rely on US-EAST-1 endpoints such as IAM and DynamoDB Global Tables also experienced issues during this time. At 12:26 AM on October 20, we identified the trigger of the event as DNS resolution issues for the regional DynamoDB service endpoints. After resolving the DynamoDB DNS issue at 2:24 AM, services began recovering but we had a subsequent impairment in the internal subsystem of EC2 that is responsible for launching EC2 instances due to its dependency on DynamoDB. As we continued to work through EC2 instance launch impairments, Network Load Balancer health checks also became impaired, resulting in network connectivity issues in multiple services such as Lambda, DynamoDB, and CloudWatch. We recovered the Network Load Balancer health checks at 9:38 AM. As part of the recovery effort, we temporarily throttled some operations such as EC2 instance launches, processing of SQS queues via Lambda Event Source Mappings, and asynchronous Lambda invocations. Over time we reduced throttling of operations and worked in parallel to resolve network connectivity issues until the services fully recovered. By 3:01 PM, all AWS services returned to normal operations. Some services such as AWS Config, Redshift, and Connect continue to have a backlog of messages that they will finish processing over the next few hours. We will share a detailed AWS post-event summary.

如下是全部受影响的服务:


评论