Hi,
Would like to know if there's any interest on having 'automatic
evacuation' feature when a compute node goes down.
I found 3 bps related to this topic:
[1] Adding a periodic task and using ServiceGroup API for
compute-node status
[2] Using ceilometer to trigger the evacuate api.
[3] Include some kind of H/A plugin by using a 'resource
optimization service'
Most of those BP's have comments like 'this logic should not reside in
nova', so that's
why i am asking what should be the best approach to have something like
that.
Should this be ignored, and just rely on external monitoring tools to
trigger the evacuation?
There are complex scenarios that require lot of logic that won't fit
into nova nor any other OS component. (For instance: sometimes it will
be faster to reboot the node or compute-nova than starting the
evacuation, but if it fail X times then trigger an evacuation, etc )
Any thought/comment// about this?
Regards
Leandro
[1] https://blueprints.launchpad.net/nova/+spec/vm-auto-ha-when-host-broken
[2]
https://blueprints.launchpad.net/nova/+spec/evacuate-instance-automatically
[3]
https://blueprints.launchpad.net/nova/+spec/resource-optimization-service