[openstack-dev] [ceilometer] The reset on the cumulative counter

Doug Hellmann doug.hellmann at dreamhost.com
Wed Nov 28 12:18:43 UTC 2012


On Wed, Nov 28, 2012 at 7:04 AM, Julien Danjou <julien at danjou.info> wrote:

> On Wed, Nov 28 2012, Doug Hellmann wrote:
>
> > There's a race condition, though. If you and I are processing events at
> the
> > same time, we get the same "last value" from the storage but then try to
> > update it to our current value.
>
> Right. Now that I think about it, that problems with multiple collector
> would also exists if we try to transform cumulative to delta on
> notifications.
>

Right. That's why it has to be done at the source, where the data is being
collected.

I talked about this a bit with the folks from Yahoo at the summit. They're
using this approach in their own monitoring, and seeing good results. They
use a little sqlite db on the compute node to persist the last few samples
from the meters so they can detect resets and the agent emits delta values
instead of cumulative values.

We have a further advantage in that we're already plugging into nova to
trigger stats collection. If we add a few more notification hooks for cases
where these counters get reset, we can be sure of detecting the reset no
matter what the polling interval is.

I think applying both techniques would give us the results we want.

Doug


>
> --
> Julien Danjou
> /* Free Software hacker & freelance
>    http://julien.danjou.info */
>
> _______________________________________________
> OpenStack-dev mailing list
> OpenStack-dev at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20121128/1f0a10b3/attachment.html>


More information about the OpenStack-dev mailing list