[Openstack-operators] [Essex] compute node hard reboot, can't create domain.

Lorin Hochstein lorin at nimbisservices.com
Mon Jul 8 02:03:08 UTC 2013


Hi Samuel:

It sounds like your VMs are configured to plug into a Linux bridge that
doesn't exist on compute-01 anymore. You could create it manually, although
I would expect that it would have been created automatically by the
relevant nova service when they came back up.

You can check if the bridge is there by doing "ip a" and looking for the
"br14" network device.

Are you running networking in multihost mode? If so, I think restarting the
nova-network service on compute-01 should do it. If you aren't running in
multihost mode, then it should come back by restarting the nova-compute
service on compute-01.

Otherwise, you'll need to create the bridge manually, and how you do that
will depend on whether you're running flat or vlan. If it was called br14,
I'm assuming you're running in vlan mode with vlan tag 14 associated with
this project?

Lorin


On Sun, Jul 7, 2013 at 9:21 PM, Samuel Winchenbach <swinchen at gmail.com>wrote:

> Hi All,
>
> I have an old Essex cluster that we are getting ready to phase out for
> grizzly.  Unfortunately over the weekend one of the compute nodes powered
> off (power supply failure it looks like).  When I tried a "nova reboot
> <UUID>"
>
> I got:
>
> 2013-07-07 21:17:34 ERROR nova.rpc.amqp
> [req-d2ea5f46-9dc2-4788-9951-07d985a1f8dc 6986639ba3c84ab5b05fdd2e122101f0
> 3806a811d2d34542bdfc5d7f31ce7b89] Exception during message handling
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp Traceback (most recent call last):
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/rpc/amqp.py", line 253, in
> _process_data
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     rval = node_func(context=ctxt,
> **node_args)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/exception.py", line 114, in wrapped
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return f(*args, **kw)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 159, in
> decorated_function
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     function(self, context,
> instance_uuid, *args, **kwargs)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 183, in
> decorated_function
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     sys.exc_info())
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/contextlib.py", line 24, in __exit__
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     self.gen.next()
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 177, in
> decorated_function
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return function(self, context,
> instance_uuid, *args, **kwargs)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 904, in
> reboot_instance
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     reboot_type)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/exception.py", line 114, in wrapped
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return f(*args, **kw)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/virt/libvirt/connection.py", line
> 721, in reboot
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     if self._soft_reboot(instance):
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/nova/virt/libvirt/connection.py", line
> 757, in _soft_reboot
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     dom.create()
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
> "/usr/lib/python2.7/dist-packages/libvirt.py", line 551, in create
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     if ret == -1: raise
> libvirtError ('virDomainCreate() failed', dom=self)
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp libvirtError: Cannot get interface
> MTU on 'br14': No such device
> 2013-07-07 21:17:34 TRACE nova.rpc.amqp
>
>
> So I tried starting it manually:
>
> root at compute-01:/etc/libvirt/qemu# virsh create instance-00000035.xml
> error: Failed to create domain from instance-00000035.xml
> error: Cannot get interface MTU on 'br14': No such device
>
>
> Any idea what I might be doing wrong?  All the services show :-) with
> nova-manage
>
>
> Thanks for your help...
>
> Sam
>
> _______________________________________________
> OpenStack-operators mailing list
> OpenStack-operators at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>
>


-- 
Lorin Hochstein
Lead Architect - Cloud Services
Nimbis Services, Inc.
www.nimbisservices.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20130707/aa7870ef/attachment.html>


More information about the OpenStack-operators mailing list