[Openstack-operators] Compute nodes reboot periodically by their own

Tim Bell Tim.Bell at cern.ch
Thu Jul 24 16:28:56 UTC 2014



Ø  I had no idea about this bmc thing Ill check it out.
Use ipmitool to query if you have a BMC. I suspect not as this requires configuration and the defaults are off.

Tim

From: Juan José Pavlik Salles [mailto:jjpavlik at gmail.com]
Sent: 24 July 2014 17:52
To: openstack-operators at lists.openstack.org
Subject: Re: [Openstack-operators] Compute nodes reboot periodically by their own


Thanks for the ideas! They are cheap clone servers using Intel® Server Board S5520HC with 48Gb of RAM.

-The nodes simply reboot, without any interesting logs. Not all at the same time.

-There's no automatic updates or anything like it.

-I don't think overheating is the problem, because they reboot early in the morning when they have no load at all. They usually reboot around 03:00 to 05:00 AM.

I had no idea about this bmc thing Ill check it out.
2014-07-24 12:09 GMT-03:00 Arne Wiebalck <Arne.Wiebalck at cern.ch<mailto:Arne.Wiebalck at cern.ch>>:
Hi,

Your compute nodes reboot or are shut off?

I am currently looking at some cases where VMs seem to spontaneously shut themselves
off. At least from the nova logs’ perspective there is no difference to a normal shutdown,
VM owners however confirm that they did not touch their VMs. So far I was unable to
explain this.

This is with Havana on a RHEL6 derivative, though.

Cheers,
 Arne

--
Arne Wiebalck
CERN IT

On 24 Jul 2014, at 16:46, Juan José Pavlik Salles <jjpavlik at gmail.com<mailto:jjpavlik at gmail.com>> wrote:

> Hello guys, We have got a small Grizzly cloud running since the begging of 2013 with Ubuntu 12.04. 2 compute nodes, a storage node and a controller, nothing too fancy. Everything works just fine, but... the compute nodes reboot themselves periodically, sometimes every 2 weeks, some times once a month. I've done almost everything I can think of: memory checks, analysed the logs, moved all the VMs to one node, and I just can't find the problem.
>
> Have you ever heard this kind of behaviour on compute nodes? Any ideas where I should look for the problem?
>
> Thanks in advance.
>
> --
> Pavlik Salles Juan José
> Blog - http://viviendolared.blogspot.com
> _______________________________________________
> OpenStack-operators mailing list
> OpenStack-operators at lists.openstack.org<mailto:OpenStack-operators at lists.openstack.org>
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators



--
Pavlik Salles Juan José
Blog - http://viviendolared.blogspot.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20140724/46eda5e1/attachment.html>


More information about the OpenStack-operators mailing list