[Win The Enterprise-wg] libvirtWatchdog status

Russell Bryant rbryant at redhat.com
Wed Dec 3 18:51:44 UTC 2014


On 12/03/2014 01:37 PM, Britten, Tyler wrote:
> It seems like the main ask from the ‘pets’ side of the enterprise is not
> instance monitoring/recovery, but hypervisor monitoring for instance
> recovery- KVM host fails, something is checking for a heartbeat, and
> once that host is marked as offline, it would check the db for the
> instances running on that host and schedule them to start on other
> remaining hosts. Ovbiously this would require shared ephemeral storage
> (NFS) or limit recovery to boot from volume instances.
> 
> Am I offbase?

Nope, and there have been lots of discussions about how to best
accomplish that.  Some think it should be built into Nova.  Others
suggest other parts of OpenStack (existing or new components) it should
be handled by.  Others (including me) suggest this should be (and can
already be) handled by tools outside of OpenStack software.

Some references:

Latest thread from openstack-dev about this:


http://lists.openstack.org/pipermail/openstack-dev/2014-October/048338.html

A blog post of mine about how I think it could be done from outside of
OpenStack today:


http://blog.russellbryant.net/2014/10/15/openstack-instance-ha-proposal/

Unfortunately I haven't been able to follow that up with next steps of
prototyping and testing to prove it out in more detail.

-- 
Russell Bryant



More information about the Enterprise-wg mailing list