[Openstack-operators] Live Migration issues

David Medberry openstack at medberry.net
Tue Jun 30 19:42:51 UTC 2015


If we have more than one live-migration occurring at a time, some will
fail. Basically the concluding handshake won't complete. It's not clear if
this is a contention issue, and issue in Juno (which has some
well-documented messaging problems), or something else.

These issues occur when trying to use l-m to evac a node (of say 8 VMs).
There really isn't a load consideration, it seems to solely be a messaging
issue.

Workarounds: we were able to work-around this issue by doing a single
live-migration at a time and ensuring it completes (is no longer marked as
migrating) prior to initiating the next one. Resolved our issue completely
but has the side-effect of taking a very long time. We have not bisected
the problem to see if we can do more than that or do each migration for a
given node singly but actually do more than one node at once. These are
exercises we will undertake very soon.

On Tue, Jun 30, 2015 at 11:50 AM, Auld, Will <will.auld at intel.com> wrote:

>  We’d like to get a better handle on issues around live migration and I
> thought that all of you may be the most knowledgeable people.
>
>
>
> 1.      In your own work do you see issues with live migration? What
> issues and how critical are they?
>
> 2.      How frequently do these issues occur?
>
> 3.      What fixes or workarounds have you employed for these issues?
>
>
>
> Thanks,
>
>
>
> Will
>
> _______________________________________________
> OpenStack-operators mailing list
> OpenStack-operators at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20150630/8f700027/attachment.html>


More information about the OpenStack-operators mailing list