[Openstack] nova-compute won't restart (on some nodes) after Grizzly upgrade
Michael Still
mikal at stillhq.com
Thu Aug 8 01:02:20 UTC 2013
Johnathan,
this would be easier to debug with a nova-compute log. Are you willing
to post one somewhere that people could take a look at?
Thanks,
Michael
On Thu, Aug 8, 2013 at 7:35 AM, Jonathan Proulx <jon at jonproulx.com> wrote:
> Hi All,
>
> Apologies to those who saw this on the operators list earlier, there is a
> bit of new info here & having gotten no response there thought I'd take it
> to a wider audience...
>
>
> I'm almost through my grizzly upgrade. I'd upgraded everything except
> nova-compute before upgrading that (ubuntu 12.04 cloud archieve pkgs).
>
> On most nodes the nova-compute service upgraded and restarted properly, but
> on some it imediately exits with:
>
> CRITICAL nova [-] 'instance_type_memory_mb'
>
> It would seem like this is https://code.launchpad.net/bugs/1161022 but the
> fix for that was released in March and I've verified is in the packaged
> version I'm using.
>
> The referenced bug involves the DB migration only updating non-deleted
> instances in the instance-system-metatata table and the patch skips the
> lookups that are broken (and irrelevant) for deleted instances.
>
> Tracing the DB calls from the host shows it is trying to do lookups for
> instances that were deleted last October, which is a bit surprising as it's
> run thousands of instances since & it's not looking those up.
>
> It is note worthy that that is around the time I upgraded from Essex ->
> Folsom so it's possible their state is weirder than most having run through
> that update.
>
> There were directories for the instances in question in
> /var/lib/nova/instances, so I thought "Aha!" and moved them, but on restart
> I still get the same failure and same DB query for the old instances. Where
> is nova getting the idea it should look these up & how can I stop it?
>
> I've go so far as to generate instance_type_<foo> entries in the
> instance_system_metadata table for all instances ever on my deployment
> (about 500k) but I still only have the cryptic "CRITICAL nova [-]
> 'instance_type_memory_mb'" error and a failure to start, so clearly I'm
> casing the wrong problem some how.
>
> Help?
> -Jon
>
> _______________________________________________
> Mailing list: http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
> Post to : openstack at lists.openstack.org
> Unsubscribe : http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack
>
--
Rackspace Australia
More information about the Openstack
mailing list