[Openstack-operators] [Essex] compute node hard reboot, can't create domain.

Lorin Hochstein lorin at nimbisservices.com
Mon Jul 8 13:03:57 UTC 2013


I think Razique's advice would have work if you were running multihost
(with multihost, nova-network runs on each compute host).

The nova-compute service *should* be recreating those bridges on startup
when not running in multihost. The issue you're running into may very well
have been an essex  bug.

Lorin

On Mon, Jul 8, 2013 at 7:26 AM, Samuel Winchenbach <swinchen at gmail.com>wrote:

> Hi Razique,
>
> That did not seem to work.  I had to manually recreate the vlan, add it to
> the bridge and add an IP.
>
> Thanks!
> Sam
>
>
> On Mon, Jul 8, 2013 at 3:39 AM, Razique Mahroua <razique.mahroua at gmail.com
> > wrote:
>
>> Hey Samuel,
>> run :
>> brctl addbr br14
>> and
>> service nova-network restart
>>
>> and voila :)
>>
>>
>> *Razique Mahroua** - **Nuage & Co*
>> razique.mahroua at gmail.com
>> Tel : +33 9 72 37 94 15
>>
>>
>> Le 8 juil. 2013 à 03:21, Samuel Winchenbach <swinchen at gmail.com> a écrit
>> :
>>
>> Hi All,
>>
>> I have an old Essex cluster that we are getting ready to phase out for
>> grizzly.  Unfortunately over the weekend one of the compute nodes powered
>> off (power supply failure it looks like).  When I tried a "nova reboot
>> <UUID>"
>>
>> I got:
>>
>> 2013-07-07 21:17:34 ERROR nova.rpc.amqp
>> [req-d2ea5f46-9dc2-4788-9951-07d985a1f8dc 6986639ba3c84ab5b05fdd2e122101f0
>> 3806a811d2d34542bdfc5d7f31ce7b89] Exception during message handling
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp Traceback (most recent call last):
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/rpc/amqp.py", line 253, in
>> _process_data
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     rval =
>> node_func(context=ctxt, **node_args)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/exception.py", line 114, in wrapped
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return f(*args, **kw)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 159, in
>> decorated_function
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     function(self, context,
>> instance_uuid, *args, **kwargs)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 183, in
>> decorated_function
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     sys.exc_info())
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/contextlib.py", line 24, in __exit__
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     self.gen.next()
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 177, in
>> decorated_function
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return function(self,
>> context, instance_uuid, *args, **kwargs)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/compute/manager.py", line 904, in
>> reboot_instance
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     reboot_type)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/exception.py", line 114, in wrapped
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     return f(*args, **kw)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/virt/libvirt/connection.py", line
>> 721, in reboot
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     if
>> self._soft_reboot(instance):
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/nova/virt/libvirt/connection.py", line
>> 757, in _soft_reboot
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     dom.create()
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp   File
>> "/usr/lib/python2.7/dist-packages/libvirt.py", line 551, in create
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp     if ret == -1: raise
>> libvirtError ('virDomainCreate() failed', dom=self)
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp libvirtError: Cannot get
>> interface MTU on 'br14': No such device
>> 2013-07-07 21:17:34 TRACE nova.rpc.amqp
>>
>>
>> So I tried starting it manually:
>>
>> root at compute-01:/etc/libvirt/qemu# virsh create instance-00000035.xml
>> error: Failed to create domain from instance-00000035.xml
>> error: Cannot get interface MTU on 'br14': No such device
>>
>>
>> Any idea what I might be doing wrong?  All the services show :-) with
>> nova-manage
>>
>>
>> Thanks for your help...
>>
>> Sam
>> _______________________________________________
>> OpenStack-operators mailing list
>> OpenStack-operators at lists.openstack.org
>> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>>
>>
>>
>
> _______________________________________________
> OpenStack-operators mailing list
> OpenStack-operators at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>
>


-- 
Lorin Hochstein
Lead Architect - Cloud Services
Nimbis Services, Inc.
www.nimbisservices.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20130708/1f3b4ae0/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: NUAGECO-LOGO-Fblan_petit.jpg
Type: image/jpeg
Size: 10122 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20130708/1f3b4ae0/attachment.jpg>


More information about the OpenStack-operators mailing list