<div dir="ltr"><div dir="ltr">Hello again,<div><br></div><div>just a short update about the results of my tests.</div><div><br></div><div>I currently see 2 ways of running openstack+rabbitmq</div><div><br></div><div>1. without durable-queues and without replication - just one rabbitmq-process which gets (somehow) restarted if it fails.</div><div>2. durable-queues and replication</div><div><br></div><div>Any other combination of these settings leads to more or less issues with</div><div><br></div><div>* broken / non working bindings</div><div>* broken queues</div><div><br></div><div>I think vexxhost is running (1) with their openstack-operator - for reasons.</div><div><br></div><div>I added [kolla], because kolla-ansible is installing rabbitmq with replication but without durable-queues.</div><div><br></div><div>May someone point me to the best way to document these findings to some official doc?</div><div>I think a lot of installations out there will run into issues if - under load - a node fails.</div><div><br></div><div> Fabian</div><div><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Am Do., 13. Aug. 2020 um 15:13 Uhr schrieb Fabian Zimmermann <<a href="mailto:dev.faz@gmail.com">dev.faz@gmail.com</a>>:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr">Hi,</div><div dir="ltr"><br></div><div>just did some short tests today in our test-environment (without durable queues and without replication):</div><div><br></div><div>* started a rally task to generate some load</div><div>* kill-9-ed rabbitmq on one node</div><div>* rally task immediately stopped and the cloud (mostly) stopped working</div><div><br></div><div>after some debugging i found (again) exchanges which had bindings to queues, but these bindings didnt forward any msgs.</div><div>Wrote a small script to detect these broken bindings and will now check if this is "reproducible"</div><div><br></div><div>then I will try "durable queues" and "durable queues with replication" to see if this helps. Even if I would expect</div><div>rabbitmq should be able to handle this without these "hidden broken bindings"</div><div><br></div><div>This just FYI.<br></div><div><br> Fabian</div></div>
</blockquote></div></div>