<div dir="ltr">Hi Serg,<div><br></div><div>Thank you for sharing this information :)</div><div><br></div><div>If I'm understanding correctly, the main reason you're using a non-clustered / corosync setup is because that's how most other components in Mirantis OpenStack are configured? Is there anything to be aware of in how Murano communicates over the agent/engine rmq in a clustered rmq setup?</div><div><br></div><div>Also, is it safe to say that communication between agent/engine only, and will only, happen during app deployment? Meaning, if the rmq server goes down (let's even say it goes away permanently for exaggeration), short of some errors in the agent log, nothing else bad will come out of it?</div><div><br></div><div>With regard to a different port and a publicly accessible address, I agree and we'll be deploying this same way.</div><div><br></div><div>One thing we just ran into, though, was getting the agent/engine rmq config to work with SSL. For some reason the murano/openstack configuration (done via oslo) had no problems recognizing our SSL cert, but the agent/engine did not like it at all. The Ubuntu Cloud packages have not been updated for a bit so we ended up patching for the "insecure" option both in engine and agent templates (btw: very nice that the agent can be installed via cloud-init -- I really didn't want to manage a second set of images just to have the agent pre-installed).</div><div><br></div><div>Thank you again,</div><div>Joe</div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Sep 22, 2016 at 10:13 PM, Serg Melikyan <span dir="ltr"><<a href="mailto:smelikyan@mirantis.com" target="_blank">smelikyan@mirantis.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Joe,<br>

<br>

I can share some details on how murano is configured as part of the<br>

default Mirantis OpenStack configuration and try to explain why it's<br>

done in that way as it's done, I hope it helps you in your case.<br>

<br>

As part of Mirantis OpenStack second instance of the RabbitMQ is<br>

getting deployed specially for the murano, but it's configuration is<br>

different than for the RabbitMQ instance used by the other OpenStack<br>

components.<br>

<br>

Why to use separate instance of the RabbitMQ?<br>

     1. Prevent possibility to get access to the RabbitMQ supporting<br>

whole cloud infrastructure by limiting access on the networking level<br>

rather than rely on authentication/authorization<br>

     2. Prevent possibility of DDoS by limiting access on the<br>

networking level to the infrastructure RabbitMQ<br>

<br>

Given that second RabbitMQ instance is used only for the murano-agent<br>

<-> murano-engine communications and murano-agent is running on the<br>

VMs we had to make couple of changes in the deployment of the RabbitMQ<br>

(bellow I am referencing RabbitMQ as RabbitMQ instance used by Murano<br>

for m-agent <-> m-engine communications):<br>

<br>

1. RabbitMQ is not clustered, just separate instance running on each<br>

controller node<br>

2. RabbitMQ is exposed on the Public VIP where all OpenStack APIs are exposed<br>

3. It's has different port number than default<br>

4. HAProxy is used, RabbitMQ is hidden behind it and HAProxy is always<br>

pointing to the RabbitMQ on the current primary controller<br>

<br>

Note: How murano-agent is working? Murano-engine creates queue with<br>

uniq name and put configuration tasks to that queue which are later<br>

getting picked up by murano-agent when VM is booted and murano-agent<br>

is configured to use created queue through cloud-init.<br>

<br>

#1 Clustering<br>

<br>

* Given that per 1 app deployment from we create 1-N VMs and send 1-M<br>

configuration tasks, where in most of the cases N and M are less than<br>

3.<br>

* Even if app deployment will be failed due to cluster failover it's<br>

can be always re-deployed by the user.<br>

* Controller-node failover most probably will lead to limited<br>

accessibility of the Heat, Nova & Neutron API and application<br>

deployment will fail regardless of the not executing configuration<br>

task on the VM.<br>

<br>

#2 Exposure on the Public VIP<br>

<br>

One of the reasons behind choosing RabbitMQ as transport for<br>

murano-agent communications was connectivity from the VM - it's much<br>

easier to implement connectivity *from* the VM than *to* VM.<br>

<br>

But even in the case when you are connecting to the broker from the VM<br>

you should have connectivity and public interface where all other<br>

OpenStack APIs are exposed is most natural way to do that.<br>

<br>

#3 Different from the default port number<br>

<br>

Just to avoid confusion from the RabbitMQ used for the infrastructure,<br>

even given that they are on the different networks.<br>

<br>

#4 HAProxy<br>

<br>

In case of the default Mirantis OpenStack configuration is used mostly<br>

to support non-clustered RabbitMQ setup and exposure on the Public<br>

VIP, but also helpful in case of more complicated setups.<br>

<br>

P.S. I hope my answers helped, let me know if I can cover something in<br>

more details.<br>

<span class="HOEnZb"><font color="#888888">--<br>

Serg Melikyan, Development Manager at Mirantis, Inc.<br>

<a href="http://mirantis.com" rel="noreferrer" target="_blank">http://mirantis.com</a> | <a href="mailto:smelikyan@mirantis.com">smelikyan@mirantis.com</a><br>

</font></span></blockquote></div><br></div>