Hello Jay,

I have another question in regards to managing nodes.
I had a situation where the undercloud-node had a problem with it’s disk and has disconnected from overcloud.
I couldn’t restore the undercloud controller and ended up re-installing the undercloud (running 'openstack undercloud install’).
The installation ended successfully but now I’m in a situation where Cleanup of nodes fails:

(undercloud) [stack@interop010 ~]$ openstack baremetal node list
+--------------------------------------+------------+---------------+-------------+--------------------+-------------+
| UUID                                       | Name       | Instance    UUID        | Power State | Provisioning State | Maintenance |
+--------------------------------------+------------+---------------+-------------+--------------------+-------------+
| 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6 | interop025 | None          | power on    | clean failed       | True        |
| 4b02703a-f765-4ebb-85ed-75e88b4cbea5 | interop026 | None          | power on    | clean failed       | True        |
+--------------------------------------+------------+---------------+-------------+--------------------+-------------+

I’ve tried to move node to available state but cannot:
(undercloud) [stack@interop010 ~]$ openstack baremetal node provide 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6
The requested action "provide" can not be performed on node "97b9a603-f64f-47c1-9fb4-6c68a5b38ff6" while it is in state "clean failed". (HTTP 400)

How do I make the nodes available again? 
as the deployment of overcloud fails with:
ERROR due to "Message: No valid host was found. , Code: 500” 

Thanks,
Igal
On 4 Mar 2021, at 20:12, Jay Faulkner <jay.faulkner@verizonmedia.com> wrote:

When a node is active with an instance UUID set, that generally indicates a nova instance (with that UUID) is provisioned onto the node. Nodes that are provisioned (active) are not able to be moved to manageable state.

If you want to reprovision these nodes, you'll want to delete the associated instances from Nova (openstack server delete instanceUUID), and after they complete a cleaning cycle they'll return to available.

Good luck,
Jay Faulkner


On Thu, Mar 4, 2021 at 10:01 AM Igal Katzir <ikatzir@infinidat.com> wrote:
Hello Forum,

I have an overcloud that gone bad and I am trying to re-deploy it, Running rhos16.1 with one director and two overcloud nodes (compute and controller)
I have re-installed undercloud and having both nodes in an active provisioning state.
Do I need to run introspection again?
Here is the outputted for baremetal node list:
(undercloud) [stack@interop010 ~]$ openstack baremetal node list
+--------------------------------------+------------+--------------------------------------+-------------+--------------------+-------------+
| UUID                                                      | Name       | Instance UUID                        | Power State | Provisioning State | Maintenance |
+--------------------------------------+------------+--------------------------------------+-------------+--------------------+-------------+
| 97b9a603-f64f-47c1-9fb4-6c68a5b38ff6 | interop025 | c7bf16b7-eb3c-4022-88de-7c5a78cda174 | power on    | active             | False       |
| 4b02703a-f765-4ebb-85ed-75e88b4cbea5 | interop026 | 99223f65-6985-4815-92ff-e19a28c2aab1 | power on    | active             | False       |
+--------------------------------------+------------+--------------------------------------+-------------+--------------------+-------------+
When I want to move each node from active > manage I get an error:
(undercloud) [stack@interop010 ~]$ openstack baremetal node manage 4b02703a-f765-4ebb-85ed-75e88b4cbea5
The requested action "manage" can not be performed on node "4b02703a-f765-4ebb-85ed-75e88b4cbea5" while it is in state "active". (HTTP 400)

How do I get to a state which is ready for deployment (available) ?

Thanks,
Igal