On 09/11/2014 01:50 PM, James Dempsey wrote: > On 12/09/14 04:15, Chris Friesen wrote: >> Hi, >> >> The OpenStack high availability guide seems to be a bit ambiguous about >> whether RabbitMQ should be configured active/standby or >> active/active...both methods are described. >> >> Has anyone tried using active/active with mirrored queues as recommended >> by the RabbitMQ developers? If so, what problems did you run into? >> >> Thanks, >> Chris >> >> _______________________________________________ >> OpenStack-operators mailing list >> OpenStack-operators at lists.openstack.org >> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators > > Hi Chris, > > We do Active/Active RabbitMQ with mirrored queues and Precise/Havana. > We faced a lot of failover problems where clients weren't figuring out > that their connections were dead. The TCP keepalives mentioned in the > following bug seemed to help a lot. > https://bugs.launchpad.net/nova/+bug/856764 The moral of our story is > to make sure you are monitoring the sanity of your agents. We had client-side issues on failovers as well. Keepalives seemed to help, but we also ported the following patch to all the applicable clients (since they're not using oslo.messaging yet). https://github.com/openstack/oslo.messaging/commit/0400cbf4f83cf8d58076c7e65e08a156ec3508a8 Chris