[neutron][ops] API for viewing HA router states

Mohammed Naser mnaser at vexxhost.com
Mon Aug 17 12:01:55 UTC 2020


Hi all,

Over the past few days, we were troubleshooting an issue that ended up
having a root cause where keepalived has somehow ended up active in
two different L3 agents.  We've yet to find the root cause of how this
happened but removing it and adding it resolved the issue for us.

As we work on improving our monitoring, we wanted to implement
something that gets us the info of # of active routers to check if
there's a router that has >1 active L3 agent but it's hard because
hitting the /l3-agents endpoint on _every_ single router hurts a lot
on performance.

Is there something else that we can watch which might be more
productive?  FYI -- this all goes in the open and will end up inside
the openstack-exporter:
https://github.com/openstack-exporter/openstack-exporter and the Helm
charts will end up with the alerts:
https://github.com/openstack-exporter/helm-charts

Thanks!
Mohammed

-- 
Mohammed Naser
VEXXHOST, Inc.



More information about the openstack-discuss mailing list