Hello Timothé
Since you are already using Grafana, you can add Openstack's mariadb Database connection as a data source which will help you collect basic information on the capacity of your cloud ecosystem.
Another add on you might like to have is benchmarking your openstack services as a part of regular health checks. For this Rally Openstack project can easily be utilised, it will help you capture service uptime and response time.
I hope this information might help you with your use case.
Hello Stackers,
I would like to know what are you all using to monitor and supervise your Openstack clusters ?
We are in the process of setting up our own private cloud based on RedHat OpenStack Platform 17.1, we choose not to go with the RedHat Service Telemetry Platform, and were strongly advise against Ceilometer and aodh.
At the moment, we built our own stack based on Prometheus (with the plenty of exporters) for the metrics, Graylog + OpenSearch for the logs, and Grafana for the visualisation.
For now, we are only looking to retrieve basic information in order to know if a dysfunction occurs, but the end goal might be to go as far as to be able to count the cpu/mem/disk usage per hour per vm per project, etc.
So, what are you all using and how did you implement it ?
Best regards,