Openstack Graphical processors virtualisation
Sean Mooney
smooney at redhat.com
Wed Aug 10 12:48:37 UTC 2022
On Wed, Aug 10, 2022 at 1:23 PM Danny Webb <Danny.Webb at thehutgroup.com> wrote:
>
> worth also mentioning that MIG isn't currently supported in Openstack. We just finished a POC with some a100 and a40 cards and the vgpu setup wasn't too hard to do, but you definitely needed to read a combination of the NVIDIA docs and the openstack docs to get a working setup.
actually from a nova perspective mig is supported however you have to
preconfigure the devices.
mig just moves the mdevs to the VFs and you need to precreate the VFs
and gpu compute instances but we dont plan to change that going
forward.
https://bugs.launchpad.net/nova/+bug/1900800 is a valid bug but its
not related to mig.
recreating the mdevs on host reboot is a complex task currently but
its being worked on.
>
> There is one open bug for mediated devices that you should be aware of though it looks like a fix is in the works:
>
> https://bugs.launchpad.net/nova/+bug/1900800
>
> For straight PCI passthrough I found some of the tooling around it in openstack lacking compared to for the vgpu devices but that may have just been me missing something. Eg, I could only really see the PCI devices in the DB but couldn't find a way to see them using the SDK / cli.
yes that is by design rather then an oversight, we are chanign that in
the zed cycle as we will start trracking pci device in placement going
forward.
but our intent is not to expose them via the nova api.
> ________________________________
> From: Sean Mooney <smooney at redhat.com>
> Sent: 10 August 2022 11:23
> To: Alvaro Soto <alsotoes at gmail.com>
> Cc: KK CHN <kkchn.in at gmail.com>; openstack-discuss <openstack-discuss at lists.openstack.org>
> Subject: Re: Openstack Graphical processors virtualisation
>
> CAUTION: This email originates from outside THG
>
> our main vgpu expert is on PTO for a few weeks but ill try and respond inline.
>
> On Wed, Aug 10, 2022 at 8:07 AM Alvaro Soto <alsotoes at gmail.com> wrote:
> >
> > Hey, just a little help with one question.
> >
> > 1-
> > https://docs.openstack.org/nova/yoga/admin/virtual-gpu.html
> > https://access.redhat.com/documentation/en-us/red_hat_openstack_platform/13/html/instances_and_images_guide/ch-virtual_gpu
> yep thos are the docs for nova's vGPU support we also support pci
> passthough of a full gpu to a vm for usecause that need that.
> >
> > On Wed, Aug 10, 2022 at 1:17 AM KK CHN <kkchn.in at gmail.com> wrote:
> >>
> >> 1. Does Openstack support GPU virtualization ? We are running a cloud using ussuri with KVM hypervisors. Is creation of vGPUs possible?
>
> yes if you have an nvidia gpu and pay for there licnese server ectra
> you can config nova to expose those vgpus to the guests.
> if you have amd gpus and those support there mxgpu feature then that
> just exposes the vGPUs as stanard sriov VFs instead of vfio-mediated
> devices
> so you use pci passthough instead of the vGPU mdev feature in that
> case to consume them.
> >>
> >> Does KVM support GPU virtualization or any limitations?
> yes however nvidia have not yet upstream support for vgpu/mdev live
> migration. redhat nvidia and others are currently activly working on
> upstreamoing that to the kernel
> qemu and libvirt but that is the main limiation. depending on your
> release nova also does not support move operations like cold migration
> however those have been
> added in more recent releases.
> >>
> >> 2. What are the supported GPU types? (GRID GPU or GPUs attached to physical blades )
> both
> >>
> >> 3. If it supports GPU attached to physical blades, will live migration of the VM be supported.Will OpenStack be able to identify the next host which has an attached GPU and perform live migration, in case of the Failure of One Blade with attached GPU with resident VM.
>
> no live migration is not possibel with mdev or sriov vf attachment currently.
> cold migration and evacuate are supported in more recent releases of
> openstack in the case of maintance or hardware failure.
> >>
> >> 4. What are the virtualization options if vGPU options are not supported by a particular GPU model?
> pci passthough of the fulll gpu or sriov if the card supprots that.
> >>
> >>
> >> Thanks in advance,
> >> Krish
> >
> >
> >
> > --
> >
> > Alvaro Soto
> >
> > Note: My work hours may not be your work hours. Please do not feel the need to respond during a time that is not convenient for you.
> > ----------------------------------------------------------
> > Great people talk about ideas,
> > ordinary people talk about things,
> > small people talk... about other people.
>
> Danny Webb
> Principal OpenStack Engineer
> The Hut Group
>
> Tel:
> Email: Danny.Webb at thehutgroup.com
>
>
> For the purposes of this email, the "company" means The Hut Group Limited, a company registered in England and Wales (company number 6539496) whose registered office is at Fifth Floor, Voyager House, Chicago Avenue, Manchester Airport, M90 3DQ and/or any of its respective subsidiaries.
>
> Confidentiality Notice
> This e-mail is confidential and intended for the use of the named recipient only. If you are not the intended recipient please notify us by telephone immediately on +44(0)1606 811888 or return it to us by e-mail. Please then delete it from your system and note that any use, dissemination, forwarding, printing or copying is strictly prohibited. Any views or opinions are solely those of the author and do not necessarily represent those of the company.
>
> Encryptions and Viruses
> Please note that this e-mail and any attachments have not been encrypted. They may therefore be liable to be compromised. Please also note that it is your responsibility to scan this e-mail and any attachments for viruses. We do not, to the extent permitted by law, accept any liability (whether in contract, negligence or otherwise) for any virus infection and/or external compromise of security and/or confidentiality in relation to transmissions sent by e-mail.
>
> Monitoring
> Activity and use of the company's systems is monitored to secure its effective use and operation and for other lawful business purposes. Communications using these systems will also be monitored and may be recorded to secure effective use and operation and for other lawful business purposes.
>
> hgvyjuv
More information about the openstack-discuss
mailing list