[openstack-dev] [Fuel-dev] [Fuel][RabbitMQ] nova-compute stuck for a while (AMQP)
Bogdan Dobrelya
bdobrelia at mirantis.com
Wed May 7 13:12:15 UTC 2014
On 05/06/2014 10:42 PM, Roman Sokolkov wrote:
> Hello, fuelers.
>
> I'm using Fuel 4.1A + Havana in HA mode.
>
> I permanently observe (on other deployments also) issue with stuck
> "nova-compute" service. But i think problem is more fundamental and
> relates to HA RabbitMQ and OpenStack AMQP driver implementation.
>
> Symptoms:
>
> * Random nova-compute from time to time marked as "XXX" for a while.
> * I see that service itself works properly. In logs i see that it
> sends status updates to conductor. But actually nothing is sent.
> * "netstat" shows that all connections to/from rabbit "ESTABLISHED"
> * rabbitmqctl shows that "compute.node-x" queue synced to all slaves.
> * nothing has been broken before, i mean rabbitmq cluster, etc.
>
> Axe style solution:
>
> * /etc/init.d/openstack-nova-compute restart
>
> So here i've found a lot of interesting stuff (and solutions):
>
> https://bugs.launchpad.net/oslo.messaging/+bug/856764
>
>
> My questions are:
>
> * Are there any thoughts particular for Fuel to solve/workaround this
> issue?
> * Any fast solution for this in 4.1? Like adjust TCP keep-alive timeouts?
Perhaps, the soultion is to apply https://review.openstack.org/#/c/34949
and check results with rabbitmq and nova. If it is OK, we could submit a
task for OSCI team to patch our internal repos and update 4.1.1 / 5.0
targeted MOS packages.
>
>
> --
> Roman Sokolkov,
> Deployment Engineer,
> Mirantis, Inc.
> Skype rsokolkov,
> rsokolkov at mirantis.com <mailto:rsokolkov at mirantis.com>
>
>
--
Best regards,
Bogdan Dobrelya,
Skype #bogdando_at_yahoo.com
Irc #bogdando
More information about the OpenStack-dev
mailing list