[Openstack-operators] Openstack telemetry stuff

Kris G. Lindgren klindgren at godaddy.com
Wed Dec 17 20:55:15 UTC 2014


Hell allo,

Coming back around to this.  I re-did our logstash.conf that we shared out ~7months ago [1] to now do deeper parsing of wgsi logs from: Nova, neutron, keystone, glance and nova-metadata services.  In our config we also have logstash outputting timing metrics (response time and number of request) to statsd, which is then stored in graphite.  I have included a kibana dashboard that is specific to just the API requests [2].  I have also included a grafana dashboard [3] for the graphite outputs.  This dashboard breaks out response time to upper/mean values for each service via get/post/delete ect ect and http status code (200-500).  This should let you see max/mean api response times for each service as well as get a count on the number of API requests during a time interval.

[1] https://github.com/godaddy/openstack-logstash
[2] https://github.com/godaddy/openstack-logstash/blob/master/API-Stats.json
[3] https://github.com/godaddy/openstack-logstash/blob/master/grafana.json

If you use this and have some comments/improvements - please feel free to send a pull request and/or contact me or Mike Dorman directly.

We also ended up using diamond to gather compute node statistics along with per vm stats running on each compute node.
____________________________________________

Kris Lindgren
Senior Linux Systems Engineer
GoDaddy, LLC.



On Mon, Nov 10, 2014 at 10:23 PM, Kris G. Lindgren <klindgren at godaddy.com<mailto:klindgren at godaddy.com>> wrote:
Hello Operators,

Was wondering what you are using to gather Openstack telemetry metrics?

Was looking at things around Openstack serivce api requests/s, response times (if possible), errors/s, Rabbitmq metrics, if possible pending or tasks that are in progress, ect ect.   Basically your more advanced and yet basic monitoring around the openstack services.  We run an ELK (elastic search, logstash, kibana) infrastructure and was wondering if anyone was a statsd/graphite output config for openstack log data -> graphite to gather some of the metrics?  If not how are you currently doing it?

Additionally, can anyone share what you are doing to get hypervisor health/vm statistics?  We are running ceilometer, but I haven't been very happy with the results.  I am also thinking something like for statsd/graphite here as well.  But if you have something that works for you and can share it.  Please DO!!  ANYTHING is welcome.
____________________________________________

Kris Lindgren
Senior Linux Systems Engineer
GoDaddy, LLC.

This email message and any attachment(s) hereto are intended for use only by its intended recipient(s) and may contain confidential information. If you have received this email in error, please immediately notify the sender and permanently delete the original and any copy of this message and its attachments.


_______________________________________________
OpenStack-operators mailing list
OpenStack-operators at lists.openstack.org<mailto:OpenStack-operators at lists.openstack.org>
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-operators/attachments/20141217/eab54194/attachment.html>


More information about the OpenStack-operators mailing list