This is a long and detailed look at the challenges of running a distributed system told from the perspective of an insider at Uber. As the author notes, "the practices might be an overkill for smaller or less mission-critical systems." But there's no harm in knowing about them, especially given the outside chance that what you're building might suddenly become the next Uber. And there are some pretty good practices here - failover drills, blameless post-mortems, black-box testing systems.
Today: 0 Total: 16 [Share]
] [