Scheduler sends VM to HV that lacks resources
    fsbiz at yahoo.com 
    fsbiz at yahoo.com
       
    Mon Nov 18 16:14:59 UTC 2019
    
    
  
 Thanks Matt for the excellent suggestions in this email and the prior one.I am currently trying to eliminate them one by one and will update.
Yes, by  forced host I do mean creating the server with an availability zone in the ZONE:NODE format.   Yes, I understand the scheduler filters aren't run but why should that bean issue? 
For now, I am tracing all the logs from the PaaS layer all the way to Openstack nova placement API tosee if there is anything unusual.
Thanks,Fred.
    On Thursday, November 14, 2019, 10:07:15 AM PST, Matt Riedemann <mriedemos at gmail.com> wrote:  
 
 On 11/14/2019 10:09 AM, fsbiz at yahoo.com wrote:
> The requests coming in are "forced host" requests.  The PaaS layer 
> maintains
> an inventory of actual bare-metal available nodes and a user has to 
> explicitly select
> a baremetal node.  The PaaS layer then makes a nova api call for an 
> instance to be created
> on that specific baremetal node.
To be clear, by forced host you mean creating the server with an 
availability zone in the format ZONE:HOST:NODE or ZONE:NODE where NODE 
is the ironic node UUID, correct?
https://docs.openstack.org/nova/latest/admin/availability-zones.html#using-availability-zones-to-select-hosts
Yeah that's a problem because then the scheduler filters aren't run.
A potential alternative is to create the server using a 
hypervisor_hostname query hint that will run through the JsonFilter:
https://docs.openstack.org/nova/latest/admin/configuration/schedulers.html#jsonfilter
Then at least you're not forcing the node and run the scheduler filters.
I forget exactly how the scheduler code works in Queens with respect to 
forced hosts/nodes on server create but the scheduler still has to 
allocate resources in placement. It looks like we work around that in 
Queens by disabling the limit we place on getting allocation candidates 
from placement:
https://review.opendev.org/#/c/584616/
My guess is your PaaS layer has bugs in it since it's allowing users to 
select hosts that are already consumed, or it's just racy. Anyway, this 
is why nova uses placement since Pike for atomic consumption of 
resources during scheduling.
-- 
Thanks,
Matt
  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-discuss/attachments/20191118/a5c64c2e/attachment.html>
    
    
More information about the openstack-discuss
mailing list