[openstack-dev] [Ceilometer] Sharing the load test result

Deok-June Yi june.yi at samsung.com
Thu Jan 9 04:13:33 UTC 2014


Hi, guys.

> jay wrote:
> > So you are saying that the Synaps server is storing 14,400,000 samples
> > in memory (2 days of 5000 samples per minute)? Or are you saying that
> > Synaps is storing just the 5000 alarm records in memory and then
> > processing (determining if the alarm condition was met) the samples as
> > they pass through to a backend data store? I think it is the latter but
> > I just want to make sure :)
> 
> Swann wrote:
> > @jay : the first case seems to be impossible, no scalable .. I bet for 
> > the last :)

Jay and Swann, your guess is right.

Synaps holds samples in memory rolled up by 1 minute resolution in its 
sliding windows per stream. The size of sliding window is 5 minutes 
by default. It helps rolling samples up without DB read operation.

So, if there was no alarm, Synaps would hold 25,000 samples (5 
minutes of 5,000 samples per minute) in memory.

When a stream has alarms, its sliding window grows according to the 
longgest 'periods * evaluation periods + default window size' of its
alarms.

In the load test case, Synaps held 5,000 alarms and 70,000 samples 
(the recent 14 minutes of 5,000 samples) in memory as they pass 
through to a backend data store. Because the alarms had 3 minutes 
periods and 3 times of evaluation periods and default window size is 
5 minutes. (3 * 3 + 5 = 14)

Swann wrote:
> The Ceilo team will work on the improvements IIUC.
> I found two relevant links [1] [2]
> [1] https://wiki.openstack.org/wiki/Ceilometer/AlarmImprovements
> [2] https://etherpad.openstack.org/p/icehouse-summit-ceilometer-future-of-alarming

Thank you for the useful links. But I just want to point out that Synaps 
has already implemented some important things in the blueprint.

Swann wrote:
> @June Yi
> I am curious to know how have you generate load to Ceilometer with 
> Ganglia ?
> 
> what was the system usage of your servers during the 2 tests  ? cpu,
> mem, io..

Ganglia was just for collecting performance data. I used my own load 
generator script. Here I attach performance data collected by ganglia. 
Please keep in mind that evaluation throughput of Ceilometer was lower 
than Synaps.

> what are response time for alarm evaluations for Ceilometer, 50 seconds 
> in mean  ?

Mean(or average) is important. But in the aspect of real-time constraint, 
I think predictability is also important. I think that there are too many variable 
factors in alarm evaluation in current Ceilometer to adapt it as a solution of 
'monitoring as a service'.

Best regards,
June Yi

-------------- next part --------------
A non-text attachment was scrubbed...
Name: loadtest_result.png
Type: application/octet-stream
Size: 32768 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20140109/0eb5842e/attachment-0001.obj>


More information about the OpenStack-dev mailing list