[Openstack-operators] PCI passthrough trying to use busy resource?

Jonathan D. Proulx jon at csail.mit.edu
Tue Oct 18 18:50:11 UTC 2016


Hi all,

I have a test GPU system that seemed to be working properly under Kilo
running 1 and 2 GPU instnace types on an 8GPU server.

After Mitaka upgrade it seems to alway try and assing the same Device
which is alredy in use rather than pick one of the 5 currently
available.


 Build of instance 9542cc63-793c-440e-9a57-cc06eb401839 was
 re-scheduled: Requested operation is not valid: PCI device
 0000:09:00.0 is in use by driver QEMU, domain instance-000abefa
 _do_build_and_run_instance
 /usr/lib/python2.7/dist-packages/nova/compute/manager.py:1945

it tries to schedule 5 times, but each time uses the same busy
device.  Since there are currently 3 in use if it had just picked a
new one each time

In trying to debug this I realize I have no idea how devices are
selected. Does OpenStack track which PCI devices are claimed or is
that a libvirt function and in either case where woudl I look to find
out what it thinks the current state is?

Thanks,
-Jon
-- 



More information about the OpenStack-operators mailing list