On Thu, 25 Feb 2021 at 15:22, Mohammed Naser <mnaser@vexxhost.com> wrote:
On Thu, Feb 25, 2021 at 4:24 AM Mark Goddard <mark@stackhpc.com> wrote:
Hi,
Some of the Kayobe CI jobs are failing when run on the vexxhost cloud. The particular part that fails is when testing 'bare metal' deployment using libvirt/qemu VMs. As far as I can tell, the test passes fairly reliably on other clouds, and fails reliably on vexxhost.
Can you please provide logs or any exact error outcomes of what's happening so I can try and help?
If we want to dig in, we'll need a little bit more information. I suspect it may be nested virtualization at play here which is always a tricky thing. Are you using that or are you using plain qemu?
Hi Mohammed, thanks for the offer of help. You are quite right about nested virt - I tracked the issue down to a change to our test configuration that inadvertently stopped us forcing the use of qemu. With that change reverted, the job does now pass on vexxhost infrastructure.
Does anyone have any suggestion as to what might be causing this? priteau suggested entropy could be an issue, but we have not investigated yet.
Is there any way to block a particular cloud provider for certain CI jobs until this is resolved?
Thanks, Mark
-- Mohammed Naser VEXXHOST, Inc.