[openstack-dev] deliver the vm-level HA to improve the business continuity with openstack

Jiangying (Jenny) jenny.jiangying at huawei.com
Mon Apr 14 13:45:53 UTC 2014


Pacemaker provides the high availability for openstack infrastructure.
We'd like to deliver the vm-level HA to improve the business continuity with openstack.

Besides host failure, Our HA mechanism can detect and report host isolation, network partition as well as ha agent down.

Ha agent is placed on every node of the system. The master agent is elected automatically on system startup and others will be regarded as the slave.
For the host states detection, the ha agents communicates through the storage subsystem as well as over the management network. Multiple communication paths enable better assessment of the health of the host. On host failure, the master agent selects the candidate hosts and calls the slave ha agents. The slave ha agents talk to the nova compute to restart the virtual machine. The master agent reacts to ha agent down by reporting to the administrator. During the network partition or host isolation event, the HA mechanism will not interrupt the virtual machines and just keep them running.

For the virtual machine detection, the ha agents relies on nova compute for the information about virtual machines.

Please let me know your comments on this.

Regards,
Jenny
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20140414/7177cd8b/attachment.html>


More information about the OpenStack-dev mailing list