[nova][placement] 2023.1 Antelope PTG summary

Sean Mooney smooney at redhat.com
Mon Oct 24 18:40:59 UTC 2022


On Mon, 2022-10-24 at 18:49 +0200, Sylvain Bauza wrote:
> We're on a wetty Monday here and I guess this is time for me to take my pen
> again and write a PTG recap email for the current release, which is 2023.1
> Antelope.
> Yet again, I *beg* you to *not* use any translation tool when reading any
> etherpad, including Google Chrome embedded translation or it would fully
> translate the whole etherpad content for *all* readers.
> In order to prevent any accidental misusage, here is a readonly copy of the
> Nova etherpad we had for the week
> https://etherpad.opendev.org/p/r.8ae4e0ef997aebfe626b2b272ff23f1b
> 
> Again, I'm human and while I can write bugs (actually I write a lot of
> them), I can also write wrong things in this email and I can
> misinterpretate something precise. Apologies if so, and please don't
> refrain them yourself to correct me by replying to this email.
> 
> Last but not the least, I'm more than open to any questions or remarks that
> would come from the reading of this long email.
> 
> Good luck with this very long thread, you should maybe grab a coffee before
> starting to read it (and I promise to keep it as short as I can).
> 
> ### Operator hours ###
> 
> After the success of the productive Nova meet-and-greet session we held in
> Berlin (packed room w/ lots of feedback), we were eager to again have a
> discussion with operators, this time virtually which would have hopefully
> lowered the entry barrier by not requiring an in-person event in order to
> attend the session.
> Consequently, we allocated three timeslots of one hour, two back-to-back on
> Tuesday and one on Wednesday (on different times to arrange time
> differences between operators).
> Unfortunately, I have to write here that we only had a very small
> attendance on Tuesday with only three operators joining (but providing
> great feedback, thanks btw. !) and *none* on the 1-hour Wednesday slot.
> 
> As a result, I won't provide statistics about release and project usage
> here as numbers are too low to be representative, but here is what we
> discussed :
> (again, a readonly etherpad is available here
> https://etherpad.opendev.org/p/r.aa8b12b385297b455138d35172698d48 for
> further context)
> 
> # Upcoming features and general discussions
> 
> - the possibility to mount Manila shares in Nova seems promising.
> - As nova doesn't support multiple physnets per neutron network today, this
> prevents the use of routed networks in some cases.
to be precise nova has never supporte the Multi Provider Network extention
https://github.com/openstack/neutron-lib/blob/master/neutron_lib/api/definitions/multiprovidernet.py

which is the extention that allows a neutron network to span multipel phsyical networks.

multiple network segments on the same phsynet (e.g. using mulitple vlans all ont eh same phsyent  or multi segment netowrks that ahve a single physnet
+ tunneled segments) can be supported today.

https://github.com/openstack/nova/commit/b9d9d96a407db5a2adde3aed81e61cc9589c291a added support for multi segment netwroks with a single
physnet in pike but there was never a nova sibling spec to add support for the multiproviernet extention.
as such we only ever use the first phsynet for all segments and its a non trivial amount of work to change that in nova escpially in
the hardware offload ovs or generic sriov case. it would require the pci manager to accept a list of possible physnets and it will require addtional
work when using placment in the future.

its not imposible to do but it would require careful impelemtation to get right and is a new feature that woudl require a spec.

> - modifying our mod_wsgi usage to be allowed to pass some arguments is
> requested
this however is more of a bug or over sight.
we expected that you would jsut pass --config-dir to the wsgi script as command line args which is possibel with uwsgi however
its not possibel with mod_wsgi. i have filed a trackign bug for this here https://bugs.launchpad.net/nova/+bug/1994056
and assgined it to myself for now.

> - operators seem interested in having PCI devices tracked in Placement
> 
> # Pain points
> 
> - getting inventories or used/availables resources in Placement becomes
> costly as it requires N calls, with N be the number of Resource Providers
> (instead of a single HTTP call). Problem has been acknowledged by the team
> and some design discussion occurred.
>  - We should paginate on the flavors API and we need to fix the private >
> public flavor bug.
>  - mediated devices disappearing at reboot is a problem. This has been
> discussed during the contributor session later, see below.
>  - routing metrics are hard to manage when you have a long list of multiple
> ports attached. We eventually agreed on the fact the proposal fix could be
> an interesting feature for Neutron team to develop.
> 
> That's it for the operators discussions. Let's now discuss about what we
> discussed in the contributors PTG agenda :
> 
> 
> ### Cross-project discussions
> 
> Two projects this time were discussing with the Nova community :
> 
> # Ironic-Nova
> 
> The whole session was about the inconsistencies that were happening if a
> nova-compute was failing down with Ironic rebalancing the nodes.
> 
> - we agreed on the fact this wasn't an easy problem to solve
> - a consensus was around the fact Ironic should support a sharding key so
> the Nova compute could use it instead of using hash rings.
> - JayF and johnthetubayguy agreed on codifying this idea into a spec
> - we'd hear feedback from operators about what they feel of the above
> migration (sharding their node cloud into sharding pieces and providing
> sharding details to nova)
> - in parallel of the above, documentation has to be amended in order to
> recommend for *existing deployments* to setup active/passive failover mode
> for Nova computes (instead of relying on hashring rebalances)
> 
> # Neutron-Nova
> 
> (I'll briefly cover the details here, as it will also be covered by the
> Neutron PTG recap)
> 
> - while Nova is finishing up tracking of PCI devices in Placement this
> cycle, we agreed on defering the modeling of Neutron-based PCI requests
> until next cycle. This won't have any impact in terms of existing features
> that will continue to operate seamlessly.
> - the way we define switchdev capabilities as of now is incorrect. We
> agreed on modfying Nova to allow it to report such capabilities to Neutron.
> This may be a specless blueprint or just a bugfix (and then potential
> backportability), to be determined later.
i will work on this this cycle. i can either use the old bug 
https://bugs.launchpad.net/neutron/+bug/1713590 which was incorrectly "fixed"
in neutron and add the nova project or i can file a new one.
i have a slight prefernce to use the old for the added context but its proably cleaner
to file a new one so i dont really mind either way.
> - MTUs are unfortunately immutable. If you change the MTU value by Neutron,
> you need to restart the instance( (or reattach the port). We consequently
> agreed on documenting this caveat, which is due to libvirt/qemu not able to
> change the MTU while the guest is running.
we also agreed to move forward with https://review.opendev.org/c/openstack/nova/+/855664
to not provide a mtu if the network is configured with dhcp enabled.
this is because neutron will provide the mtu via dhcp in that case.
i have abandoned https://review.opendev.org/c/openstack/nova/+/852367
> - We eventually agreed on letting --hostname (parameter of an instance
> creation request) to represent a FQDN thru a nova microversion. This value
> won't be sanitized by nova and directly passed to Neutron as the hostname
> value. We also agreed on the fact Nova WON'T ever touch the port domain
> value in metadata, as this is not in the project scope to manage name
> domains.
just to clarify the last point we will pass it once whne the port is created by nova
as we do today usting the value of instance.hostname. it it will have the behvior that that
has today where it is never change after first boot.
i propossed that if desginate wants to automaticly update the hostname they can do so by extending
the designate-sink componet which listens for notifcation to automaticly registry fixed/floating ips
with dns record.

if the desinate project feel its in socpe they coudl repond to the instnace update notification if the
instace.hostname is updated and modify the relevent neutron port or dns record but we strongly advise
them not to ever propagage such a change to floating ip. even for the fixed ip records without a new
neutron extention on the port to opt into the dns_name update its likely that updating the dns records
on instace hostname update would break exsiting usage so i advise caution if desinate choose to add this feature
in the future.
> 
> 
> ### Procedural discussions (skip it if you're not interacting with the Nova
> contributors) ###
> 
> # Zed retrospective
> 
> - We had a productive Zed release and we made a good job on reducing bugs
> and bug reports. Kudos to the team again.
> - The microversions etherpad we had was nice but planning microversions
> usage is hard. WE agreed on rather providing a document to contributors
> explaining them how to write an API change that adds a new microversion and
> how to easily rebase this change if a merge conflict occurs (due to the
> micoversion being taken by another patch that merged)
> - We agreed on keeping an etherpad for tracking all API changes during
> milestone-3
> - We agreed on filing blueprints for TC goals that impact Nova
> - bauzas will promote again the use of review-priority label in Gerrit
> during weekly meetings in order for cores to show their interest in
> reviewing a particular patch.
> 
> # Promoting backlog features to new contributors and enhance mentoring
> 
> - Starting this cycle, we'll put our list of small and easily actionable
> blueprints into Launchpad bug reports that have a "Wishlist" status and
> both a 'low-hanging-fruit' and a 'rfe' tag. New contributors or people
> wanting to mentor a new upstream member are more than welcome to consume
> that list of 'rfe' bugs and identify the ones they're willing to work on. A
> detailed process will be documented for helping newcomers to join.
> - We'll also draft our pile of work we defer into 'backlog specs' if they
> require further design (and are less actionable from a newcomer perspective)
> 
> # Other procedural and release discussions
> 
> - we'll stop using Storyboard for tracking Placement features and bugs and
> we'll pivot back to Launchpad for the 'placement' sub-project.
> - we agreed on a *feature* review day possibly 4 weeks before the feature
> freeze in order to catch up any late design issue we could have missed when
> reviewing the spec.
i would also suggest we maybe have one at the start of december before we hit PTO sesion to see if we can land any features
that are ready early.
we can figure that out on irc or in the team meeting.
> - we will EOL stein and older nova branches with elodilles proposing the
> deletion patch. We agred on discussing EOL'ing train at next PTG.
> - gibi will work on providing a static requirements file for nova that
> would use minimum versions of our dependencies (non transitively) and
> modify unit and functional test jobs to rather use this capped requirements
> file for testing.
> - we discussed the User Survey and we agreed on discussing any question we
> may want to add in the next survey during the next weekly meetings.
> - "2023.1 Antelope" naming is a bit confusing, but we agreed on we should
> continue to either use "2023.1" or "2023.1 Antelope" for naming our docs.
did we because i tought we agreeed to use "27.0.0 (antelope)" in all docs or 
gmans suggestion of "2023.1 Antelope (nova 27.0.0)" so that we continuity in our
docs and release notes. '"2023.1" or "2023.1 Antelope" or 2023.1 (27.0.0)' were what i propsed we should not use in our
docs as both are confusting based on our past naming.
> We also wait for guidelines in order to consistently name our next stable
> branch (2023.1 possibly).
for the branch name i do think stable/2023.1 makes sense and the docs url shoudl be
https://docs.openstack.org/nova/2023.1/ to match the branch name.
the other option is https://docs.openstack.org/nova/antelope/
> 
> 
> (That's the end of procedural and release discussions, please resume here
> if you skipped the above)
> 
> ### Technical bits ###
> 
> 
> # VMware driver status
> 
> - As currently no job runs happened since April, we agreed on communicating
> in the nova-compute logs at periodic times that the code isn't tested so
> operators running it would know its upstream status.
> - we'll update the supported matrix documentation to refiect this
> 'UNTESTED' state
> 
> # TC goals this cycle
> 
> - for the FIPS goal, we agreed on the fact the current FIPS job (running on
> centos 9 stream) shouldn't be running on gate and check pipelines. As the
> current job is busted (all periodic runs go into TIMEOUT state), we also
> want to modify the job timeout to allow 30 mins more time for running (due
> to a reboot in the job definition)
> - for the oslo.privsep goal, no effort is required from a nova perspective
> (all deliverables are already using privsep). sean-k-mooney can propose a
> 'rfe' bug (see the note above on new contributors) for modifying our
> privsep usage in nova (using different privsep context by callers)
i can file an RFE bug although it might be better to also file a backlog spec for this
or contibutors doc to describe the correct way to use prvisep going forward.

1 context per class of permsions (networking, file, ectra),
do not import privespe functionl define them only in the modules where they are used.
include privaldged in the name.
have a narrow contract taking a fixed number of named parmaters never commands as string.

> - for the Ubunutu 2022.4 goal, gmann already provided changes. We need to
> review them.
> - for the RBAC goal, let's discuss it in a proper section just below
> 
> # Next steps on RBAC
> 
> - we need to test new policy defaults in a full integrated tempest
> testsuite (meaning with other projects), ideally before the first milestone.
> - once we check everything works fine as expected, we can flip the default
> (enabling new policies) before milestone-2
> - we'd like to start drafting the new service role usage by two new specs
> (one for nova, the other one for placement)
> 
> # Power management in Nova
> 
> A pile of work I'm proud we gonna start this cycle. This is about
> disabling/enabling cores on purpose, so power savings occur.
>  - we agreed on supporting it directly in Nova (and to not design and
> support an external tool which would suppose to draft some heave interface
> for an easy quickwin). This would just be a config flag to turn on that
> would enable CPU cores on demand.
https://review.opendev.org/c/openstack/nova/+/821228
intoduced cpu_external_management as a proposed config name i woudl suggest
based on the change of direction we shoudl add cpu_power_state_management instead.
>  - a potential follow-up *may* be to use a different CPU governor depending
> on flavors or images but this won't be scoped for this release.
yes we agreed to not do this this cycle and evaulate it in the future.
> 
> # Power monitoring in Nova
> 
> I don't really like this name, as monitoring isn't part of the Nova mission
> statement so I'll clarify. Here, this is about providing an internal
> readonly interface for power monitoring tools running on guests that would
> be able to capture host consumption metrics. One example of such monitoring
> tools is Scaphandre, demonstrated during the last OpenInfraSummit at a
> keynote.
> - we agreed on reusing virtiofs support we're gonna introduce in this
> release for the Manila share attachment usecase
>  - this share would be readonly and would be unique per instance (ie. we
> wouldn't be supporting multiple guest agents reading different directories)
>  - this share would be enabled thru a configuration flag per compute, and
> would only be mounted per instance by specific flavor or image
> extraspec/metadata.
> 
> # Database soft-deleted records and the archive/purge commands
> 
> - We don't want to deprecate DB soft-deleted records as some APIs continue
> to rely on those records.
> - We rather prefer to add a new parameter to the purge command that will
> directly delete the 'soft-deleted' records from the main DBs and not only
> the shadow tables (skipping the need to archive, if the operator wants to
> use this parameter)
> 
> # Nova-compute failing at reboot due to vGPU missing devices
> 
> Talking of bug #1900800, the problem is that mediated devices aren't
> persistent so nova-compute isn't able to respawn the instances after a
> reboot if those are vGPU-flavored.
> - we agreed on the fact that, like any other device, Nova shouldn't try to
> create them and we should rather ask the operator to pre-create the
> mediated devices, exaclty like we do for SR-IOV VFs.
> - we'll accordingly deprecate the mdev creation in the libvirt driver (but
> we'll continue to support it) and we'll log a warning if Nova has to create
> one mdev.
> - we'll change the nova-compute init script to raise a better exception
> explaining which mdev is missing
> - we'll document a procedure for explaining how to get existing mdev
> information and persist them by udev rules (for upgrade cases)
> 
> # Robustify our compute hostname changes
> 
> To be clear, at first step, we will continue to *NOT* support nova-compute
> hostname changes but we'll better detect the hostname change to prevent
> later issues.
> - first step to persist the compute UUID on disk seems a good candidate so
> a spec is targeted this cycle.
specifricaly we need to persit the compute service uuid in the compute manager.
we can discuss the details in teh sepc as we may also want to persice the comptue node
uuid but that would have differnt behavior depening on the virt driver in use e.g. vmware/ironic (1:n) vs livbirt (1:1)
> - next steps could be to robustify our relationships between instance,
> compute node and service object records but this design will be deferred
> for later in a backlog spec.
> 
> 
> # Move to OpenStackClient and SDK
> 
> OSC is already fully supported in Zed but it continues to rely on
> novaclient python bindings for calling the Nova API.
> - we agreed on modifying OSC to rather use openstacksdk (instead of
> novaclient) for communicating to the Nova APIs. Contributors welcome on
> this move.
> - we agreed on stopping to use project client libraries in nova services
> (eg. cinderclient used by nova-compute) and rather use openstacksdk
> directly. A 'rfe' bug per project client will be issued, anyone willing to
> work on it is welcome.
> - we also agreed on continuing to support novaclient for a couple of
> releases, as operators or other consumers of this package could require
> substantial efforts to move to the sdk.
> - we agreed on changing the release model for novaclient to be independent
> so we can release anytime we need.
> 
> # Evacuate to target state
> 
> We understand the usecase (evacuate an instance shouldn't always turn the
> evacuated instance on)
> - that said, we don't want to amend the API for passing a parameter as this
> would carry a tech debt
> indefinitely)
> - we prefer to propose a new microversion that would stop the instance
> eventually instead of starting it
we also agreed to file a bug for the existing behaivor of evacuating a stop instance
we currently start it then stop it which is what will initally happen for the new microverion
but long term we want to avoid start the vm entirly.

This is safer as the admin does not know what workload is in the guest and it may not be safe
(disk currouption) or possible (encypted volumes) to start teh guest when we evacuate.

we can and should catch the exectption raised in teh latter case to allow the vm to be evacuated
and powered off in the short term. medium to long term we agreed to not start the vm with the new micorverison
and also adress this for resize verify wehre we shoudl not attempt to start the vm if it was stoped.

this will crrect the current odd behaivor in the instnace action log where the vm is started and stop.
> - operators wanting to keep the original behaviour would need to negociate
> an older microversion to the Nova API as we don't intend to make spawning
> optionally chosen by API  on evacuate.
specificly if the old micorverion is used then the behvior and limiations we have today will be supproted
indefintly. with teh new micorverion the opeartor is opting in to evacuate to shutoff state.
> 
> (aaaaaaaaaaaaaaaand that's it for the PTG recap)
> 
> Kudos, you were brave, you reached that point. Hope your coffee was good
> and now you feel rejuvenated. Anyway, time for me now to rest my fingers
> and to enjoy a deserved time off.
> 
> As said, I'm all up for any questions or remarks that would come from the
> reading of this enormous thread.
> 
> -Sylvain




More information about the openstack-discuss mailing list