Werner: "Root cause was single bit corruption of internal state messages that are distributed via Gossip techniques."
Interesting - Amazon using gossip to propagate information. Presumably this is done to reduce the overall number of calls and avoid any central point of failure.