<p dir="ltr"></p>
<p dir="ltr">Damn, there are multiple discussions on scheduler split while I'm on vacation, thus I can't correctly respond the best I would...</p>
<p dir="ltr">Le 24 oct. 2014 03:11, "Jay Pipes" <<a href="mailto:jaypipes@gmail.com">jaypipes@gmail.com</a>> a écrit :<br>
><br>
> On 10/23/2014 07:57 PM, Elzur, Uri wrote:<br>
>><br>
>> Today, OpenStack makes placement decision mainly based on Compute<br>
>> demands (Scheduler is part of Nova). It also uses some info provided<br>
>> about platform’s Compute capabilities. But for a given application<br>
>> (consists of some VMs, some Network appliances, some storage etc),<br>
>> Nova/Scheduler has no way to figure out relative placement of Network<br>
>> devices (virtual appliances, SFC) and/or Storage devices (which is also<br>
>> network born in many cases) in reference to the Compute elements. This<br>
>> makes it harder to provide SLA, support certain policies (e.g. HA or<br>
>> keeping all of these elements within a physical boundary of your choice,<br>
>> or within a given network physical boundary and guarantee storage<br>
>> proximity, for example. It also makes it harder to optimize resource<br>
>> utilization level, which increases the cost and may cause Openstack to<br>
>> be less competitive on TCO.<br>
>><br>
>> Another aspect of the issue, is that in order, to lower the cost per<br>
>> unit of compute (or said better per unit of Application), it is<br>
>> essential to pack tighter. This increases infrastructure utilization but<br>
>> also makes interference a more important phenomenon (aka Nosy neighbor).<br>
>> SLA requests, SLA guarantees and placement based on ability to provide<br>
>> desired SLA are required.<br>
>><br>
>> We’d like to suggest moving a bit faster on making OpenStack a more<br>
>> compelling stack for Compute/Network/Storage, capable of supporting<br>
>> Telco/NFV and other usage models, and creating the foundation for<br>
>> providing very low cost platform, more competitive with large cloud<br>
>> deployment.<br>
><br>
><br>
> How do you suggest moving faster?<br>
><br>
> Also, when you say things like "more competitive with large cloud deployment" you need to tell us what you are comparing OpenStack to, and what cost factors you are using. Otherwise, it's just a statement with no context.<br>
><br>
><br>
>> The concern is that any scheduler change will take long time. Folks<br>
>> closer to the Scheduler work, have already pointed out we first need to<br>
>> stabilize the API between Nova and the Scheduler, before we can talk<br>
>> about a split (e.g. Gantt). So it may take till late in 2016 (best<br>
>> case?), to get this kind of broader Application level functionality in<br>
>> the OpenStack scheduler .<br>
><br>
><br>
> I'm not entirely sure where late in 2016 comes from? Could you elaborate?<br>
><br>
><br>
>> We’d like to bring it up in the coming design summit. Where do you think<br>
>> it needs to be discussed: cross project tack? Scheduler discussion? Other?<br>
>><br>
>> I’ve just added a proposed item 17.1 to the<br>
>> <a href="https://etherpad.openstack.org/p/kilo-crossproject-summit-topics">https://etherpad.openstack.org/p/kilo-crossproject-summit-topics</a><br>
>><br>
>> 1.<br>
>><br>
>> 2.“present Application’s Network and Storage requirements, coupled with<br>
>> infrastructure capabilities and status (e.g. up/dn<br>
><br>
><br>
> This is the kind of thing that was nixed as an idea last go around with the "nic-state-aware-scheduler":<br>
><br>
> <a href="https://review.openstack.org/#/c/87978/">https://review.openstack.org/#/c/87978/</a><br>
><br>
> You are coupling service state monitoring with placement decisions, and by doing so, you will limit the scale of the system considerably. We need improvements to our service state monitoring, for sure, including the ability to have much more fine-grained definition of what a service is. But I am 100% against adding the concept of checking service state *during* placement decisions.<br>
><br>
> Service state monitoring (it's called the servicegroup API in Nova) can and should notify the scheduler of important changes to the state of resource providers, but I'm opposed to making changes to the scheduler that would essentially make a placement decision and then immediately go and check a link for UP/DOWN state before "finalizing" the claim of resources on the resource provider.<br>
><br>
><br>
>> , utilization levels) and placement policy (e.g. proximity, HA)<br>
><br>
><br>
> I understand proximity (affinity/anti-affinity), but what does HA have to do with placement policy? Could you elaborate a bit more on that?<br>
><br>
><br>
> > to get optimized placement<br>
>><br>
>> decisions accounting for all application elements (VMs, virt Network<br>
>> appliances, Storage) vs. Compute only”<br>
><br>
><br>
> Yep. These are all simply inputs to the scheduler's placement decision engine. We need:<br>
><br>
> a) A way of providing these inputs to the launch request without polluting a cloud user's view of the cloud -- remember we do NOT want users of the Nova API to essentially need to understand the exact layout of the cloud provider's datacenter. That's definitively anti-cloudy :) So, we need a way of providing generic inputs to the scheduler that the scheduler can translate into specific inputs because the scheduler would know the layout of the datacenter...<br>
><br>
> b) Simple condition engine that would be able to understand the inputs (requested proximity to a storage cluster used by applications running in the instance, for example) with information the scheduler can query for about the topology of the datacenter's network and storage.<br>
><br>
> Work on b) involves the following foundational blueprints:<br>
><br>
> <a href="https://review.openstack.org/#/c/127609/">https://review.openstack.org/#/c/127609/</a><br>
> <a href="https://review.openstack.org/#/c/127610/">https://review.openstack.org/#/c/127610/</a><br>
> <a href="https://review.openstack.org/#/c/127612/">https://review.openstack.org/#/c/127612/</a><br>
><br>
> Looking forward to a cross-project or Nova session at the summit. Either would work for me, and I encourage you and others to response to this ML thread with thoughts about the scheduler and resource tracker space so I may summarize the thoughts on an etherpad before the summit.<br>
><br>
> All the best,<br>
> -jay<br>
><br>
><br></p>
<p dir="ltr">I just saw that Scheduler and Resource Tracker got 2 slots for discussion at the Design Summit so we can kick-off a discussion around heterogenous resources that Scheduler would have to play with.<br>
Anyway there is a blueprint called isolate-scheduler-db [1] aiming to define a clear way to provide such metrics thanks to the reviews proposed above.</p>
<p dir="ltr">[1] <a href="https://review.openstack.org/89843">https://review.openstack.org/89843</a> _______________________________________________<br>
> OpenStack-dev mailing list<br>
> <a href="mailto:OpenStack-dev@lists.openstack.org">OpenStack-dev@lists.openstack.org</a><br>
> <a href="http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev">http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev</a><br>
</p>