[Openstack-operators] [Ceilometer] Real world experience with Ceilometer deployments - Feedback requested
Matt Joyce
matt at nycresistor.com
Thu Feb 12 16:17:48 UTC 2015
I thought stacktach was more in the vein of diagnostic. Not billable resources.
On Feb 12, 2015 10:47 AM, Tim Bell <Tim.Bell at cern.ch> wrote:
>
> Does anyone have any proposals regarding
>
> > - Possible replacements for Ceilometer that you have used instead
>
> It seems that many sites have written their own systems. The stacktach/monasca teams are due to demo to the operators meetup in Philadelphia in March.
>
> Does anyone have experience to share comparing ceilometer with stacktach ?
>
> Tim
>
> > -----Original Message-----
> > From: Daniele Venzano [mailto:daniele.venzano at eurecom.fr]
> > Sent: 12 February 2015 12:24
> > To: openstack-operators at lists.openstack.org
> > Subject: Re: [Openstack-operators] [Ceilometer] Real world experience with
> > Ceilometer deployments - Feedback requested
> >
> > Unfortunately, I can only confirm the sorry state of Ceilometer.
> > We tried it on a very small setup (6 compute nodes) and run in so many issues,
> > we dropped it and created our own solution based on a mix of scripts that read
> > from the nova/neutron DB, iptables and collectd data. No need for more
> > collection agents than what we are already running for the systems monitoring.
> >
> > We tried the version in Havana and, later, in Icehouse. For starters the
> > documentation was suggesting MySQL as default backend. MySQL will last just a
> > few days and then break down under the size of the tables. We tried MongoDB,
> > but were still not satisfied with performance on such a small cluster.
> > Then there is the metering agent. It is yet another daemon, not integrated in
> > Neutron and there is no documentation about what it is actually measuring.
> > What if I have multiple routers? Ingress and Egress? From which point of view?
> > The same applies to Cinder, it requires and external agent (to be run via cron!).
> >
> > Some metrics were not recorded, we couldn't understand why and, again, no
> > documentation and no tooling to help us understand whether we were just
> > missing some config options somewhere in nova-compute or there was some
> > other problem with KVM/libvirt versions.
> > And even when we had some data and wanted to generate just a proof-of-
> > concept report with some information about tenant resource usage, we found
> > problems with the API. The fact that no one had bothered to write a simple
> > proof of concept script that uses the API to actually do something useful was
> > really off-putting.
> >
> > We had to dig in libvirt to understand what some of the metrics actually mean.
> > We found that we could read those same metrics from our (more efficient, well-
> > known) monitoring system.
> >
> > For some time we run just the agents and aggregated the data in an
> > elasticsearch instance through the UDP msgpack pipeline (more bugs, message
> > format is inconsistent, different agents generate different fields, in slightly
> > different formats).
> > It works. But for our needs it was just too much work. Most of the data is
> > already available from other sources with well-known APIs.
> >
> > Ah, also there is a long standing bug open: Sahara and Ceilometer cannot be
> > used together. And we use Sahara.
> >
> > I opened bugs for some of these issues, but since then I lost interest.
> >
> > In the end, I think it really depends on what kind of data you need and what
> > (developer) resources you can throw at the problem.
> > Unless in Juno things changed dramatically, Ceilometer will not work out of the
> > box. You will have to lose time because of the non-existent documentation, you
> > will have to develop code and scripts anyway and finally you will have to create
> > something between your billing system and the ceilometer API, because to the
> > best of my knowledge there is nothing that uses it.
> >
> > eBay has the resources to do all that. We don't.
> >
> >
> >
> > -----Original Message-----
> > From: George Shuklin [mailto:george.shuklin at gmail.com]
> > Sent: Thursday 12 February 2015 02:59
> > To: openstack-operators at lists.openstack.org
> > Subject: Re: [Openstack-operators] [Ceilometer] Real world experience with
> > Ceilometer deployments - Feedback requested
> >
> > Ceilometer is in sad state.
> >
> > 1. Collector leaks memory. We ran it on same host with mongo, and it grab
> > 29Gb out of 32, leaving mongo with less than gig memory available.
> > 2. Metering agent cause huge load on neutron-server. o(n) of metering rules and
> > tenants. Few bugs reported, one bugfix in review.
> > 3. Metering agent simply do no work on multi-network-nodes installation.
> > It exepects all routers be on same host. Fixed or not - I don't know, we have our
> > own crude fix.
> > 4. Many rough edges. Ceilometer much less tested than nova. Sometimes it
> > traces and skip counting. Fresh example: if metadata has '.' in the name,
> > ceilometer trace on it and did not count in glance usage.
> > 5. Very slow on reports (using mongo's mapreduce).
> >
> > Overall feeling: barely usable, but with my experience with cloud billings, not the
> > worst thing I saw in my life.
> >
> > About load: except reporting and memory leaks, it use rather small amount of
> > resources.
> >
> > On 02/11/2015 09:37 PM, Maish Saidel-Keesing wrote:
> > > Is Ceilometer ready for prime time?
> > >
> > > I would be interested in hearing from people who have deployed
> > > OpenStack clouds with Ceilometer, and their experience. Some of the
> > > topics I am looking for feedback on are:
> > >
> > > - Database Size
> > > - MongoDB management, Sharding, replica sets etc.
> > > - Replication strategies
> > > - Database backup/restore
> > > - Overall useability
> > > - Gripes, pains and problems (things to look out for)
> > > - Possible replacements for Ceilometer that you have used instead
> > >
> > >
> > > If you are willing to share - I am sure it will be beneficial to the
> > > whole community.
> > >
> > > Thanks in Advance
> > >
> > >
> > > With best regards,
> > >
> > >
> > > Maish Saidel-Keesing
> > > Platform Architect
> > > Cisco
> > >
> > >
> > >
> > >
> > > _______________________________________________
> > > OpenStack-operators mailing list
> > > OpenStack-operators at lists.openstack.org
> > > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operator
> > > s
> >
> >
> > _______________________________________________
> > OpenStack-operators mailing list
> > OpenStack-operators at lists.openstack.org
> > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
> >
> >
> > _______________________________________________
> > OpenStack-operators mailing list
> > OpenStack-operators at lists.openstack.org
> > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
>
> _______________________________________________
> OpenStack-operators mailing list
> OpenStack-operators at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-operators
More information about the OpenStack-operators
mailing list