Ok Jay, let me to describe my environment. I have an openstack made up of 3 controllers nodes ad several compute nodes. The controller nodes services are controlled by pacemaker and the compute nodes services are controlled by remote pacemaker. My hardware is Dell so I am using ipmi fencing device . I wrote a service controlled by pacemaker: this service controls if a compude node fails and for avoiding split brains if a compute node does nod respond on the management network and on storage network the stonith poweroff the node and then execute a nova host-evacuate. Anycase to have a simulation before writing the service I described above you can do as follows: connect on one compute node where some virtual machines are running run the command: echo 'c' > /proc/sysrq-trigger (it stops immediately the node like in case of failure) On a controller node run: nova host-evacuate "name of failed compute node" Instances running on the failed compute node should be restarted on another compute node Ignazio Il giorno gio 11 lug 2019 alle ore 11:57 Jay See <jayachander.it@gmail.com> ha scritto:
Hi ,
I have tried on a failed compute node which is in power off state now. I have tried on a running compute node, no errors. But nothing happens. On running compute node - Disabled the compute service and tried migration also.
May be I might have not followed proper steps. Just wanted to know the steps you have followed. Otherwise, I was planning to manual migration also if possible. ~Jay.
On Thu, Jul 11, 2019 at 11:52 AM Ignazio Cassano <ignaziocassano@gmail.com> wrote:
Hi Jay, would you like to evacuate a failed compute node or evacuate a running compute node ?
Ignazio
Il giorno gio 11 lug 2019 alle ore 11:48 Jay See < jayachander.it@gmail.com> ha scritto:
Hi Ignazio,
I am trying to evacuate the compute host on older version (mitaka). Could please share the process you followed. I am not able to succeed with openstack live-migration fails with error message (this is known issue in older versions) and nova live-ligration - nothing happens even after initiating VM migration. It is almost 4 days.
~Jay.
On Thu, Jul 11, 2019 at 11:31 AM Ignazio Cassano < ignaziocassano@gmail.com> wrote:
I am sorry. For simulating an host crash I used a wrong procedure. Using "echo 'c' > /proc/sysrq-trigger" all work fine
Il giorno gio 11 lug 2019 alle ore 11:01 Ignazio Cassano < ignaziocassano@gmail.com> ha scritto:
Hello All, on ocata when I poweroff a node with active instance , doing a nova host-evacuate works fine and instances are restartd on an active node. On queens it does non evacuate instances but nova-api reports for each instance the following:
2019-07-11 10:19:54.745 13811 INFO nova.api.openstack.wsgi [req-daad0a7d-87ce-41bf-b096-a70fc306db5c 0c7a2d6006614fe2b3e81e47377dd2a9 c26f8d35f85547c4add392a221af1aab - default default] HTTP exception thrown: Cannot 'evacuate' instance e8485a5e-3623-4184-bcce-cafd56fa60b3 while it is in task_state powering-off
So it poweroff all instance on the failed node but does not start them on active nodes
What is changed ? Ignazio
-- P *SAVE PAPER – Please do not print this e-mail unless absolutely necessary.*
-- P *SAVE PAPER – Please do not print this e-mail unless absolutely necessary.*