<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div class="">Hello all,</div><div class=""><br class=""></div><div class="">While troubleshooting this, another observation I see is that when I run put the node in state provide:</div><div class="">'openstack baremetal node provide 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6’</div><div class="">It starts the cleaning process, then the node boots into PXE but the undercloud ignores it.</div><div class="">When I tap the port I see that requests reach its interface:</div><div class=""><br class=""></div><div class="">(undercloud) [stack@interop010 ~]$ sudo tcpdump -i br-ctlplane</div><div class="">10:43:10.600421 IP 0.0.0.0.bootpc > 255.255.255.255.bootps: BOOTP/DHCP, Request from a0:36:9f:95:dd:e2 (oui Unknown), length 548</div><div class=""><br class=""></div><div class="">But on the same time the dnsmasq ignores it:</div><div class="">(undercloud) [stack@interop010 ~]$ sudo tail -f /var/log/containers/ironic-inspector/dnsmasq.log</div><div class="">Mar 24 10:39:43 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) 6c:ae:8b:69:ee:80 ignored</div><div class="">Mar 24 10:40:36 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) a0:36:9f:95:dd:e2 ignored</div><div class="">Mar 24 10:40:39 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) a0:36:9f:95:dd:e2 ignored</div><div class="">Mar 24 10:40:48 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) 6c:ae:8b:69:ee:80 ignored</div><div class="">Mar 24 10:41:52 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) 6c:ae:8b:69:ee:80 ignored</div><div class="">Mar 24 10:42:57 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) 6c:ae:8b:69:ee:80 ignored</div><div class="">Mar 24 10:43:06 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) a0:36:9f:95:dd:e2 ignored</div><div class="">Mar 24 10:43:10 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) a0:36:9f:95:dd:e2 ignored</div><div class="">Mar 24 10:43:14 dnsmasq-dhcp[7]: DHCPDISCOVER(br-ctlplane) a0:36:9f:95:dd:e2 ignored</div><div class=""><br class=""></div><div class="">Why is that?</div><div class="">What is needed for the cleanup to start?</div><div class=""><br class=""></div><div class="">Thanks,</div><div class="">Igal</div><div class=""><br class=""></div><div><blockquote type="cite" class=""><div class="">On 24 Mar 2021, at 0:09, Igal Katzir <<a href="mailto:ikatzir@infinidat.com" class="">ikatzir@infinidat.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><meta http-equiv="Content-Type" content="text/html; charset=utf-8" class=""><div style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">Hello Team,<div class=""><br class=""></div><div class=""><div class="">I had a situation where my <i class="" style="font-weight: bold;">undercloud-node </i>had a problem with it’s disk and has disconnected from overcloud.</div><div class="">I couldn’t restore the undercloud controller and ended up re-installing it (running 'openstack undercloud install’).</div><div class="">The installation ended successfully but now I’m in a situation where Cleanup of the overcloud deployed nodes fails:</div><div class=""><br class=""></div><div class=""><div class="">(undercloud) [stack@interop010 ~]$ openstack baremetal node list</div><div class="">+--------------------------------------+------------+---------------+-------------+--------------------+-------------+</div><div class="">| UUID | Name | Instance UUID | Power State | Provisioning State | Maintenance |</div><div class="">+--------------------------------------+------------+---------------+-------------+--------------------+-------------+</div><div class="">| 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6 | interop025 | None | power on | clean failed | True |</div><div class="">| 4b02703a-f765-4ebb-85ed-75e88b4cbea5 | interop026 | None | power on | clean failed | True |</div><div class="">+--------------------------------------+------------+---------------+-------------+--------------------+-------------+</div><div class=""><br class=""></div><div class="">I’ve tried to move node to available state but cannot:</div><div class=""><div class="">(undercloud) [stack@interop010 ~]$ openstack baremetal node provide 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6</div><div class="">The requested action "provide" can not be performed on node "97b9a603-f64f-47c1-9fb4-6c68a5b38ff6" while it is in state "clean failed". (HTTP 400)</div><div class=""><br class=""></div><div class="">My question is:</div><div class=""><div class=""><b class="">How do I make the nodes available again?</b> </div><div class="">as the deployment of overcloud fails with:</div><div class="">ERROR due to "Message: No valid host was found. , Code: 500” </div><div class=""><br class=""></div><div class="">Thanks,</div><div class="">Igal</div></div></div></div></div></div></div></blockquote></div><br class=""></body></html>