[You don't often get email from
tyler@ghosty.pw. Learn why this is important at
https://aka.ms/LearnAboutSenderIdentification ]
Hello All,
I'm trying to get vGPU's to work with openstack and I believe I am
almost there, however I can't seem to get them to register to
placement.
Here is what I have so far
OS: Ubuntu 24.04
Kernel: 6.8.0-62-generic
Nvidia GRID Version: 570.158.02-570.158.01-573.39
Openstack Deployment: Kolla w/ Docker
Openstack Version: 2025.1
I have the SRIOV devices enabled with the systemd service, and can see
the vgpu types;
# cat /sys/bus/pci/devices/0000:25:00.0/virtfn0/nvidia/creatable_vgpu_types
ID : vGPU Name
908 : NVIDIA L4-1B
909 : NVIDIA L4-2B
910 : NVIDIA L4-1Q
911 : NVIDIA L4-2Q
912 : NVIDIA L4-3Q
913 : NVIDIA L4-4Q
914 : NVIDIA L4-6Q
915 : NVIDIA L4-8Q
916 : NVIDIA L4-12Q
917 : NVIDIA L4-24Q
918 : NVIDIA L4-1A
919 : NVIDIA L4-2A
920 : NVIDIA L4-3A
921 : NVIDIA L4-4A
922 : NVIDIA L4-6A
923 : NVIDIA L4-8A
924 : NVIDIA L4-12A
925 : NVIDIA L4-24A
My kolla node nova custom config (for nova-compute) is
[DEFAULT]
debug = true
verbose = true
[devices]
enabled_mdev_types = nvidia-918
[mdev_nvidia-918]
max_instances = 8
[libvirt]
live_migration_downtime = 500000
live_migration_downtime_steps = 3
live_migration_downtime_delay = 3
I've also created flavors, classes, and traits with;
openstack flavor create l4-1a --ram 8192 --disk 40 --vcpus 4
openstack flavor set l4-1a --property "resources:VGPU=1" --property
"trait:CUSTOM_NVIDIA_918=required"
openstack resource class create CUSTOM_NVIDIA_918
openstack trait create CUSTOM_NVIDIA_918
However I can't seem to get placement to show any vGPU's or any of the
traits I registered:
This just has an empty line
# openstack allocation candidate list --resource VGPU=1
and this will just show the standard vCPU/Memory/Disk
# openstack resource provider inventory list <Host UUID>
Have I missed a step somewhere, do I need to prepare the devices
further before nova can pick them up?
Thanks for any and all help!