Hi,
just did some short tests today in our test-environment (without durable queues and without replication):
* started a rally task to generate some load
* kill-9-ed rabbitmq on one node
* rally task immediately stopped and the cloud (mostly) stopped working
after some debugging i found (again) exchanges which had bindings to queues, but these bindings didnt forward any msgs.
Wrote a small script to detect these broken bindings and will now check if this is "reproducible"
then I will try "durable queues" and "durable queues with replication" to see if this helps. Even if I would expect
rabbitmq should be able to handle this without these "hidden broken bindings"
This just FYI.
Fabian