<font size=2 face="sans-serif">Let me elaborate a little on my thoughts
about software orchestration, and respond to the recent mails from Zane
and Debo. I have expanded my picture at </font><a href=https://docs.google.com/drawings/d/1Y_yyIpql5_cdC8116XrBHzn6GfP_g0NHTTG_W4o0R9U><font size=2 face="sans-serif">https://docs.google.com/drawings/d/1Y_yyIpql5_cdC8116XrBHzn6GfP_g0NHTTG_W4o0R9U</font></a><font size=2 face="sans-serif">
and added a companion picture at </font><a href="https://docs.google.com/drawings/d/1TCfNwzH_NBnx3bNz-GQQ1bRVgBpJdstpu0lH_TONw6g"><font size=2 face="sans-serif">https://docs.google.com/drawings/d/1TCfNwzH_NBnx3bNz-GQQ1bRVgBpJdstpu0lH_TONw6g</font></a><font size=2 face="sans-serif">
that shows an alternative.</font>
<br>
<br><font size=2 face="sans-serif">One of the things I see going on is
discussion about better techniques for software orchestration than are
supported in plain CFN. Plain CFN allows any script you want in userdata,
and prescription of certain additional setup elsewhere in cfn metadata.
But it is all mixed together and very concrete. I think many
contributors would like to see something with more abstraction boundaries,
not only within one template but also the ability to have modular sources.</font>
<br>
<br><font size=2 face="sans-serif">I work closely with some colleagues
who have a particular software orchestration technology they call Weaver.
It takes as input for one deployment not a single monolithic template
but rather a collection of modules. Like higher level constructs
in programming languages, these have some independence and can be re-used
in various combinations and ways. Weaver has a compiler that weaves
together the given modules to form a monolithic model. In fact, the
input is a modular Ruby program, and the Weaver compiler is essentially
running that Ruby program; this program produces the monolithic model as
a side effect. Ruby is a pretty good language in which to embed a
domain-specific language, and my colleagues have done this. The modular
Weaver input mostly looks declarative, but you can use Ruby to reduce the
verboseness of, e.g., repetitive stuff --- as well as plain old modularity
with abstraction. We think the modular Weaver input is much more
compact and better for human reading and writing than plain old CFN. This
might not be obvious when you are doing the "hello world" example,
but when you get to realistic examples it becomes clear.</font>
<br>
<br><font size=2 face="sans-serif">The Weaver input discusses infrastructure
issues, in the rich way Debo and I have been advocating, as well as software.
For this reason I describe it as an integrated model (integrating
software and infrastructure issues). I hope for HOT to evolve to
be similarly expressive to the monolithic integrated model produced by
the Weaver compiler.</font>
<br>
<br><font size=2 face="sans-serif">In Weaver, as well as in some of the
other software orchestration technologies being discussed, there is a need
for some preparatory work before the infrastructure (e.g., VMs) is created.
This preparatory stage begins the implementation of the software
orchestration abstractions. Here is the translation from something
more abstract into flat userdata and other cfn metadata. For Weaver,
this stage also involves some stack-specific setup in a distinct coordination
service. When the VMs finally run their userdata, the Weaver-generated
scripts there use that pre-configured part of the coordination service
to interact properly with each other.</font>
<br>
<br><font size=2 face="sans-serif">I think that, to a first-order approximation,
the software orchestration preparatory stage commutes with holistic infrastructure
scheduling. They address independent issues, and can be done in either
order. That is why I have added a companion picture; the two pictures
show the two orders.</font>
<br>
<br><font size=2 face="sans-serif">My claim of commutativity is limited,
as I and colleagues have demonstrated only one of the two orderings; the
other is just a matter of recent thought. There could be gotchas
lurking in there.</font>
<br>
<br><font size=2 face="sans-serif">Between the two orderings, I have a
preference for the one I first mentioned and have experience with actually
running. It has the virtue of keeping related things closer together:
the software orchestration compiler is next to the software orchestration
preparatory stage, and the holistic infrastructure scheduling is next to
the infrastructure orchestration.</font>
<br>
<br><font size=2 face="sans-serif">In response to Debo's remark about flexibility:
I am happy to see an architecture that allows either ordering if it turns
out that they are both viable and the community really wants that flexibility.
I am not so sure we can totally give up on architecting where things
go, but this level of flexibility I can understand and get behind (provided
it works).</font>
<br>
<br><font size=2 face="sans-serif">Just as a LP solver is a general utility
whose uses do not require architecting, I can imagine a higher level utility
that solves abstract placement problems. Actually, this is not a
matter of imagination. My group has been evolving such a thing for
years. It is now based, as Debo recommends, on a very flexible and
general optimization algorithm. But the plumbing between it and the
rest of the system is significant; I would not expect many users to take
on that magnitude of task.</font>
<br>
<br><font size=2 face="sans-serif">I do not really want to get into dogmatic
fights over what gets labelled "heat". I will leave the
questions about which piece goes where in the OpenStack programs and projects
to those more informed and anointed. What I am trying to accomplish
is an agreement about how various bits of functionality will relate.</font>
<br>
<br><font size=2 face="sans-serif">Zane wrote:</font>
<br><tt><font size=2>> To take the first example, wouldn't your holistic
scheduler effectively have<br>
> to reserve a compute instance and some directly attached block storage
prior<br>
> to actually creating them? Have you considered Climate rather than
Heat as<br>
> an integration point?</font></tt>
<br>
<br><font size=2 face="sans-serif">I had not considered Climate. Based
on recent ML traffic, I see that Climate is about scheduling into the future,
whereas I am only trying to talk about scheduling for the present. OTOH,
perhaps you are concerned about concurrency issues. I am too. Doing
a better job on that is a big part of the revision my group is working
on now. I think it can be done. I plan to post a pointer to
some details soon.</font>
<br>
<br><font size=2 face="sans-serif">Perhaps the concern is about competition
between two managers trying to manage the same resources. I think
that is (a) something that can not be completely avoided and (b) impossible
to do well. My preference is to focus on one manager, and make sure
it tolerates surprises in a way that is not terrible. Even without
competing managers, bugs and other unexpected failures will cause nasty
surprises.</font>
<br>
<br><font size=2 face="sans-serif">In response to my remark about infrastructure
orchestration being downstream from holistic infrastructure scheduling,
Zane wrote:</font>
<br><tt><font size=2>> I agree that it's necessarily 'downstream' (in
the sense of happening<br>
> afterwards). I'd hesitate to use the word 'logically', since I think
by it's<br>
> very nature a holistic scheduler introduces dependencies between services<br>
> that were intended to be _logically_ independent.</font></tt>
<br>
<br><font size=2 face="sans-serif">I am not sure I understand the objection
here. All I am saying is that a holistic infrastructure scheduler
is one of the things that needs the services of something that does infrastructure
orchestration, and I see no need to introduce a new such thing. Surely
it is not bad for various things to use the existing mechanism. I
see plans for Trove to also do this, and for the revised autoscaling service
to also do this.</font>
<br>
<br><font size=2 face="sans-serif">Zane later wrote:</font>
<br><tt><font size=2>> As proposed, the software configs contain directives
like 'hosted_on:<br>
> server_name'. (I don't know that I'm a huge fan of this design, but
I don't<br>
> think the exact details are relevant in this context.) There's no<br>
> non-trivial processing in the preparatory stage of software orchestration<br>
> that would require it to be performed before scheduling could occur.</font></tt>
<br>
<br><font size=2 face="sans-serif">I hope I have addressed that with my
remarks above about software orchestration.</font>
<br>
<br><font size=2 face="sans-serif">Zane also wrote:</font>
<br><tt><font size=2>> Let's make sure we distinguish between doing
holistic scheduling, which<br>
> requires a priori knowledge of the resources to be created, and automatic<br>
> scheduling, which requires psychic knowledge of the user's mind. (Did
the<br>
> user want to optimise for performance or availability? How would you
infer<br>
> that from the template?)</font></tt>
<br>
<br><font size=2 face="sans-serif">One reason I favor holistic infrastructure
scheduling is that I want its input to be richer than today's CFN templates.
Like Debo, I think the input can contain the kind of information
that would otherwise require mind-reading. My group has been working
examples involving multiple levels of anti-co-location statements, network
reachability and proximity statements, disk exclusivity statements, and
statements about the presence of licensed products.</font>
<br>
<br><font size=2 face="sans-serif">Zane ended with:</font>
<br><tt><font size=2>> .. There's nothing that happens while preparing
the<br>
> software configurations that's necessary for the former nor sufficient
for<br>
> the latter.</font></tt>
<br>
<br><font size=2 face="sans-serif">If I understand correctly, I agree,
as mentioned in my above remarks about how software orchestration preparation
commutes with holistic infrastructure scheduling.</font>
<br>
<br><font size=2 face="sans-serif">Regards,</font>
<br><font size=2 face="sans-serif">Mike</font>