[Openstack] strange problem when reboot nova-compute node: domain not found: no domain with matching name

Razique Mahroua razique.mahroua at gmail.com
Tue Sep 18 12:16:23 UTC 2012


Interesting....
you have a NFS share between several nodes ?
does the nova user have the rights to write on all the directories within /var/lib/nova/instances ?
May be a right issue...but not so sure

Nuage & Co - Razique Mahroua 
razique.mahroua at gmail.com



Le 18 sept. 2012 à 11:26, romizhang1968 <romizhang1968 at 163.com> a écrit :

> additional information:
> when I boot a new instance with centos image, system show me:
> 
> libvir: QEMU error : Domain not found: no domain with matching name 'instance-00000019'
> .......
> -4a73-b147-0c4f02191d2c] Ignoring error injecting data into image 9f595093-d52f-4b0f-8ff8-19ed98fc09f4 (Unexpected error while running command.
> Command: sudo kpartx -d /dev/nbd15
> Exit code: 1
> Stdout: ''
> Stderr: 'device-mapper: remove ioctl failed: Device or resource busy\n')
> 2012-09-18 17:21:56 INFO nova.virt.libvirt.connection [-] [instance: 25b2e627-f37b-4a73-b147-0c4f02191d2c] Instance spawned successfully.
> 
> At 2012-09-17 21:18:27,"Razique Mahroua" <razique.mahroua at gmail.com> wrote:
> Hey Romi,
> looks like the image is not totally spawned, meaning it doesn't get added to Virsh, during the spawing, look at the /var/log/nova/nova-compute.log and look for errors.
> Can you send the file ?
> 
> Nuage & Co - Razique Mahroua 
> razique.mahroua at gmail.com
> 
> <NUAGECO-LOGO-Fblan_petit.jpg>
> 
> Le 17 sept. 2012 à 14:56, romi zhang <romizhang1968 at 163.com> a écrit :
> 
>> Razique,
>>  
>> Additional information:
>> Another testing system was not full of testing job. I met this problem before at my small system. If I use ubuntu image download from network,this problem was not happened. If I use centos and windows image create instance,this problem will happened,and in the process of building centos or windows instance, nova-compute always promote: libvirt qemu error,domain not found. But finally instance would run normally, and then to reboot the host of nova-compute and then run nova-compute ,it would stop at :CRITICAL error: libvirt qemu error, domain not found,…..
>>  
>> Romi
>>  
>> From: romi zhang [mailto:romizhang1968 at 163.com] 
>> Sent: Monday, September 17, 2012 8:48 PM
>> To: 'Razique Mahroua'
>> Cc: 'Ritesh Nanda'; 'openstack at lists.launchpad.net'
>> Subject: RE: [Openstack] strange problem when reboot nova-compute node: domain not found: no domain with matching name
>>  
>> Razique,
>>  
>> Appreciate for your attention.
>>  
>> # ls /etc/libvirt/qemu/networks/autostart
>> default.xml
>>  
>> at my memory,when the first time I computed the system installation, reboot  compute-node have no problem, and then after I created centos and windows instances (in the booting process,system show :libvirt qemu error: domain not found) and reboot one of compute-nodes ,nova-compute stopped and show me : CRITICAL error, Libvirt: QEMU error: Domain not found: no domain with matching name ‘instance-0000000a’. then the others also have same problem.
>>  
>> At my corner,it is a sysmatic behavior.
>>  
>> But at my another testing system, this problem had not happened.
>>  
>> Regards,
>>  
>> Romi
>>  
>> From: Razique Mahroua [mailto:razique.mahroua at gmail.com] 
>> Sent: Sunday, September 16, 2012 5:04 PM
>> To: romi zhang
>> Cc: 'Ritesh Nanda'; openstack at lists.launchpad.net
>> Subject: Re: [Openstack] strange problem when reboot nova-compute node: domain not found: no domain with matching name
>>  
>> Hey, 
>> could you remove the flag :
>> --resume_guests_state_on_host_boot=true
>> ans restart the nova-compute service 
>>  
>> Can you run
>> $ ls  /etc/libvirt/qemu/autostart 
>>  
>> Does that behaviour is systematic ? Everytime you restart the service nova-compute, or do you need to also have your compute node restarted ?
>>  
>>  
>> Nuage & Co - Razique Mahroua 
>> razique.mahroua at gmail.com
>> 
>> <image001.jpg>
>>  
>> Le 15 sept. 2012 à 12:00, romi zhang <romizhang1968 at 163.com> a écrit :
>>  
>> Razique,
>>  
>> Thank you very much!
>>  
>> Here is the paste:
>> http://paste.openstack.org/show/20938/
>>  
>> this config was fine at the first testing system.
>>  
>> Current problem occurred at my second testing system, and at before I created raw format image and convert to qcow2.
>>  
>> I am going to recreate image directly on qcow2 format,are just in processing,……
>>  
>> Is there anything should be cared with image? It looks like related with the glance image,I guess.
>>  
>> Romi
>>  
>> From: Razique Mahroua [mailto:razique.mahroua at gmail.com] 
>> Sent: Saturday, September 15, 2012 5:43 PM
>> To: romi zhang
>> Cc: 'Ritesh Nanda'; openstack at lists.launchpad.net
>> Subject: Re: [Openstack] strange problem when reboot nova-compute node: domain not found: no domain with matching name
>>  
>> Hey I just wanted to see if the domain was still declared within virsh database
>> in case you can also go to your instances directory and un a $ virsh define $instance-name/domain.xml
>> Can you post your nova.conf please ?
>>  
>> Nuage & Co - Razique Mahroua 
>> razique.mahroua at gmail.com
>> 
>> <image001.jpg>
>>  
>> Le 15 sept. 2012 à 08:26, romi zhang <romizhang1968 at 163.com> a écrit :
>> 
>> 
>> 
>> Hi Razique,
>>  
>> Is there anything need to have a deeper look?
>> Virsh list looks like well,same as the instance I created through nova boot.
>>  
>> Romi
>>  
>> From: Razique Mahroua [mailto:razique.mahroua at gmail.com] 
>> Sent: Friday, September 14, 2012 4:10 AM
>> To: Ritesh Nanda
>> Cc: romi zhang; openstack at lists.launchpad.net
>> Subject: Re: [Openstack] strange problem when reboot nova-compute node: domain not found: no domain with matching name
>>  
>> Hi Romi, 
>> what $ virsh list --all gives you ?
>> Nuage & Co - Razique Mahroua 
>> razique.mahroua at gmail.com
>> 
>> <image001.jpg>
>>  
>> Le 13 sept. 2012 à 21:11, Ritesh Nanda <riteshnanda09 at gmail.com> a écrit :
>> 
>> 
>> 
>> 
>> Hello romi,
>> 
>>     Image got into a stuck state , only solution is to hack your database for this particular instance and mark it as deleted, den restart nova-* service, it would start working.
>> 
>> On Thu, Sep 13, 2012 at 9:45 PM, romi zhang <romizhang1968 at 163.com> wrote:
>> Hi,
>>  
>> I’ve installed essex and when I reboot one of the nova-compute node and next when I start nova-compute service again,the system promote:
>> Libvirt: QEMU error: Domain not found: no domain with matching name ‘instance-0000000a’
>>  
>> Then I found:  
>> #ls /var/lib/nova/instances
>> _base instance-0000000a instance-0000000b instance-0000000c
>> #ls /etc/libvirt/qemu
>> instance-0000000b.xml instance-0000000c.xml Network
>>  
>> So:
>> /var/lib/nova/instances/instance-0000000a# virsh define libvirt.xml
>> Domain instance-0000000a defined from libvirt.xml
>> And:
>> #ls /etc/libvirt/qemu
>> Instance-0000000a.xml instance-0000000b.xml instance-0000000c.xml network
>>  
>> Then I restart: nova-compute &, and nova-compute could normally started.
>> I did not understand why reboot compute node,and usually the first instance’s xml file was deleted and nova-compute will terminate.
>>  
>> By the way, when I create a new instance, the first line in the log also is : Libvirt: QEMU error: Domain not found: no domain with matching name ‘instance-0000000a’
>>  
>> Anyone who had met such a problem and why?
>>  
>> Regards,
>>  
>> romi
>> 
>> _______________________________________________
>> Mailing list: https://launchpad.net/~openstack
>> Post to     : openstack at lists.launchpad.net
>> Unsubscribe : https://launchpad.net/~openstack
>> More help   : https://help.launchpad.net/ListHelp
>> 
>> 
>> 
>> 
>> --
>>  With Regards 
>>  Ritesh Nanda
>>  
>>  
>> 
>> 
>> _______________________________________________
>> Mailing list: https://launchpad.net/~openstack
>> Post to     : openstack at lists.launchpad.net
>> Unsubscribe : https://launchpad.net/~openstack
>> More help   : https://help.launchpad.net/ListHelp
> 
> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack/attachments/20120918/2ab6db8c/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: NUAGECO-LOGO-Fblan_petit.jpg
Type: image/jpeg
Size: 10122 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack/attachments/20120918/2ab6db8c/attachment.jpg>


More information about the Openstack mailing list