[openstack-dev][PCI passthrough] How to use PCI passthrough feature correctly? And is this BUG in update_devices_from_hypervisor_resources?

Sean Mooney smooney at redhat.com
Thu Mar 2 10:54:38 UTC 2023


adding Dmitrii who was the primary developer of the openstack integration so
they can provide more insight.

Dmitrii did you ever give a presentationon the DPU support and how its configured/integrated
that might help fill in the gaps for simon?

more inline.

On Thu, 2023-03-02 at 11:05 +0800, Simon Jones wrote:
> E...
> 
> But there are these things:
> 
> 1) Show some real happened in my test:
> 
> - Let me clear that, I use DPU in compute node:
> The graph in
> https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html .
> 
> - I configure exactly follow
> https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html,
> which is said bellow in "3) Let me post all what I do follow this link".
> 
> - In my test, I found after first three command (which is "openstack
> network create ...", "openstack subnet create", "openstack port create ..."),
> there are network topology exist in DPU side, and there are rules exist in
> OVN north DB, south DB of controller, like this:
> 
> > ```
> > root at c1:~# ovn-nbctl show
> > switch 9bdacdd4-ca2a-4e35-82ca-0b5fbd3a5976
> > (neutron-066c8dc2-c98b-4fb8-a541-8b367e8f6e69) (aka selfservice)
> >     port 01a68701-0e6a-4c30-bfba-904d1b9813e1
> >         addresses: ["unknown"]
> >     port 18a44c6f-af50-4830-ba86-54865abb60a1 (aka pf0vf1)
> >         addresses: ["fa:16:3e:13:36:e2 172.1.1.228"]
> > 
> > gyw at c1:~$ sudo ovn-sbctl list Port_Binding
> > _uuid               : 61dc8bc0-ab33-4d67-ac13-0781f89c905a
> > chassis             : []
> > datapath            : 91d3509c-d794-496a-ba11-3706ebf143c8
> > encap               : []
> > external_ids        : {name=pf0vf1, "neutron:cidrs"="172.1.1.241/24",
> > "neutron:device_id"="", "neutron:device_owner"="",
> > "neutron:network_name"=neutron-066c8dc2-c98b-4fb8-a541-8b367e8f6e69,
> > "neutron:port_name"=pf0vf1,
> > "neutron:project_id"="512866f9994f4ad8916d8539a7cdeec9",
> > "neutron:revision_number"="1",
> > "neutron:security_group_ids"="de8883e8-ccac-4be2-9bb2-95e732b0c114"}
> > 
> > root at c1c2dpu:~# sudo ovs-vsctl show
> > 62cf78e5-2c02-471e-927e-1d69c2c22195
> >     Bridge br-int
> >         fail_mode: secure
> >         datapath_type: system
> >         Port br-int
> >             Interface br-int
> >                 type: internal
> >         Port ovn--1
> >             Interface ovn--1
> >                 type: geneve
> >                 options: {csum="true", key=flow, remote_ip="172.168.2.98"}
> >         Port pf0vf1
> >             Interface pf0vf1
> >     ovs_version: "2.17.2-24a81c8"
> > ```
> > 
> That's why I guess "first three command" has already create network
> topology, and "openstack server create" command only need to plug VF into
> VM in HOST SIDE, DO NOT CALL NEUTRON. As network has already done.
no that jsut looks like the standard bridge toplogy that gets created when you provision
the dpu to be used with openstac vai ovn.

that looks unrelated to the neuton comamnd you ran.
> 
> - In my test, then I run "openstack server create" command, I got ERROR
> which said "No valid host...", which is what the email said above.
> The reason has already said, it's nova-scheduler's PCI filter module report
> no valid host. The reason "nova-scheduler's PCI filter module report no
> valid host" is nova-scheduler could NOT see PCI information of compute
> node. The reason "nova-scheduler could NOT see PCI information of compute
> node" is compute node's /etc/nova/nova.conf configure remote_managed tag
> like this:
> 
> > ```
> > [pci]
> > passthrough_whitelist = {"vendor_id": "15b3", "product_id": "101e",
> > "physical_network": null, "remote_managed": "true"}
> > alias = { "vendor_id":"15b3", "product_id":"101e",
> > "device_type":"type-VF", "name":"a1" }
> > ```
> > 
> 
> 2) Discuss some detail design of "remote_managed" tag, I don't know if this
> is right in the design of openstack with DPU:
> 
> - In neutron-server side, use remote_managed tag in "openstack port create
> ..." command.
> This command will make neutron-server / OVN / ovn-controller / ovs to make
> the network topology done, like above said.
> I this this is right, because test shows that.
that is not correct
your test do not show what you think it does, they show the baisic bridge
toplogy and flow configuraiton that ovn installs by defualt when it manages
as ovs.

please read the design docs for this feature for both nova and neutron to understand how the interacction works.
https://specs.openstack.org/openstack/nova-specs/specs/yoga/implemented/integration-with-off-path-network-backends.html
https://specs.openstack.org/openstack/neutron-specs/specs/yoga/off-path-smartnic-dpu-port-binding-with-ovn.html
> 
> - In nova side, there are 2 things should process, first is PCI passthrough
> filter, second is nova-compute to plug VF into VM.
> 
> If the link above is right, which remote_managed tag exists in
> /etc/nova/nova.conf of controller node and exists in /etc/nova/nova.conf of
> compute node.
> As above ("- In my test, then I run "openstack server create" command")
> said, got ERROR in this step.
> So what should do in "PCI passthrough filter" ? How to configure ?
> 
> Then, if "PCI passthrough filter" stage pass, what will do of  nova-compute
> in compute node?
> 
> 3) Post all what I do follow this link:
> https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html.
> - build openstack physical env, link plug DPU into compute mode, use VM as
> controller ... etc.
> - build openstack nova, neutron, ovn, ovn-vif, ovs follow that link.
> - configure DPU side /etc/neutron/neutron.conf
> - configure host side /etc/nova/nova.conf
> - configure host side /etc/nova/nova-compute.conf
> - run first 3 command
> - last, run this command, got ERROR
> 
> ----
> Simon Jones
> 
> 
> Sean Mooney <smooney at redhat.com> 于2023年3月1日周三 18:35写道:
> 
> > On Wed, 2023-03-01 at 18:12 +0800, Simon Jones wrote:
> > > Thanks a lot !!!
> > > 
> > > As you say, I follow
> > > https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html.
> > > And I want to use DPU mode. Not "disable DPU mode".
> > > So I think I should follow the link above exactlly, so I use
> > > vnic-type=remote_anaged.
> > > In my opnion, after I run first three command (which is "openstack
> > network
> > > create ...", "openstack subnet create", "openstack port create ..."), the
> > > VF rep port and OVN and OVS rules are all ready.
> > not at that point nothign will have been done on ovn/ovs
> > 
> > that will only happen after the port is bound to a vm and host.
> > 
> > > What I should do in "openstack server create ..." is to JUST add PCI
> > device
> > > into VM, do NOT call neutron-server in nova-compute of compute node (
> > like
> > > call port_binding or something).
> > this is incorrect.
> > > 
> > > But as the log and steps said in the emails above, nova-compute call
> > > port_binding to neutron-server while running the command "openstack
> > server
> > > create ...".
> > > 
> > > So I still have questions is:
> > > 1) Is my opinion right? Which is "JUST add PCI device into VM, do NOT
> > call
> > > neutron-server in nova-compute of compute node ( like call port_binding
> > or
> > > something)" .
> > no this is not how its designed.
> > until you attach the logical port to a vm (either at runtime or as part of
> > vm create)
> > the logical port is not assocated with any host or phsical dpu/vf.
> > 
> > so its not possibel to instanciate the openflow rules in ovs form the
> > logical switch model
> > in the ovn north db as no chassie info has been populated and we do not
> > have the dpu serial
> > info in the port binding details.
> > > 2) If it's right, how to deal with this? Which is how to JUST add PCI
> > > device into VM, do NOT call neutron-server? By command or by configure?
> > Is
> > > there come document ?
> > no this happens automaticaly when nova does the port binding which cannot
> > happen until after
> > teh vm is schduled to a host.
> > > 
> > > ----
> > > Simon Jones
> > > 
> > > 
> > > Sean Mooney <smooney at redhat.com> 于2023年3月1日周三 16:15写道:
> > > 
> > > > On Wed, 2023-03-01 at 15:20 +0800, Simon Jones wrote:
> > > > > BTW, this link (
> > > > > 
> > https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html)
> > > > said
> > > > > I SHOULD add "remote_managed" in /etc/nova/nova.conf, is that WRONG ?
> > > > 
> > > > no its not wrong but for dpu smart nics you have to make a choice when
> > you
> > > > deploy
> > > > either they can be used in dpu mode in which case remote_managed
> > shoudl be
> > > > set to true
> > > > and you can only use them via neutron ports with
> > vnic-type=remote_managed
> > > > as descried in that doc
> > > > 
> > > > 
> > https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html#launch-an-instance-with-remote-managed-port
> > > > 
> > > > 
> > > > or if you disable dpu mode in the nic frimware then you shoudl remvoe
> > > > remote_managed form the pci device list and
> > > > then it can be used liek a normal vf either for neutron sriov ports
> > > > vnic-type=direct or via flavor based pci passthough.
> > > > 
> > > > the issue you were havign is you configured the pci device list to
> > contain
> > > > "remote_managed: ture" which means
> > > > the vf can only be consumed by a neutron port with
> > > > vnic-type=remote_managed, when you have "remote_managed: false" or
> > unset
> > > > you can use it via vnic-type=direct i forgot that slight detail that
> > > > vnic-type=remote_managed is required for "remote_managed: ture".
> > > > 
> > > > 
> > > > in either case you foudn the correct doc
> > > > https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html
> > > > neutorn sriov port configuration is documented here
> > > > https://docs.openstack.org/neutron/latest/admin/config-sriov.html
> > > > and nova flavor based pci passthough is documeted here
> > > > https://docs.openstack.org/nova/latest/admin/pci-passthrough.html
> > > > 
> > > > all three server slightly differnt uses. both neutron proceedures are
> > > > exclusivly fo network interfaces.
> > > > https://docs.openstack.org/neutron/latest/admin/ovn/smartnic_dpu.html
> > > > requires the use of ovn deployed on the dpu
> > > > to configure the VF contolplane.
> > > > https://docs.openstack.org/neutron/latest/admin/config-sriov.html uses
> > > > the sriov nic agent
> > > > to manage the VF with ip tools.
> > > > https://docs.openstack.org/nova/latest/admin/pci-passthrough.html is
> > > > intended for pci passthough
> > > > of stateless acclerorators like qat devices. while the nova flavor
> > approch
> > > > cna be used with nics it not how its generally
> > > > ment to be used and when used to passthough a nic expectation is that
> > its
> > > > not related to a neuton network.
> > > > 
> > > > 
> > 
> > 




More information about the openstack-discuss mailing list