idk why ppl think cloud services r so reliable . I mean, we just had a major outage at Amazon's AWS... a software bug in their DynamoDB DNS management system caused a domino effect that took down whole systems! it's crazy to think that even after they fixed the issue, some EC2 instances were still experiencing network connectivity problems for days. what's the point of having a "multi-region design" if you're still gonna have those kinds of backup problems? we need better redundancy and fail-safes in cloud infrastructure, not just lip service to it
this is crazy that amazon's biggest failure is caused by a simple software bug . like, how can you even have a system that relies on a single point of failure in the first place? it's just basic engineering 101. and now they're having to fix it with bandaids like disabling certain tools instead of taking the time to rewrite the whole thing from scratch . and what about all the other companies who rely on amazon's services? how are they supposed to mitigate the impact of this failure without access to reliable infrastructure?