[openstack-dev] [Fuel] fuel master monitoring

Przemyslaw Kaminski pkaminski at mirantis.com
Thu Nov 6 11:20:36 UTC 2014


I didn't mean a robust monitoring system, just something simpler. 
Notifications is a good idea for FuelWeb.

P.

On 11/06/2014 09:59 AM, Anton Zemlyanov wrote:
> We can add a notification to FuelWeb, no additional software or user 
> actions are required. I would not overestimate this method though, it 
> is in no way the robust monitoring system. Forcing user to do 
> something on a regular basis is unlikely to work.
>
> Anton
>
> On Thu, Nov 6, 2014 at 11:55 AM, Przemyslaw Kaminski 
> <pkaminski at mirantis.com <mailto:pkaminski at mirantis.com>> wrote:
>
>     I think we're missing the point here. What I meant adding a simple
>     monitoring system that informed the user via UI/CLI/email/whatever
>     of low resources on fuel master node. That's it. HA here is not an
>     option -- if, despite of warnings, the user still continues to use
>     fuel and disk becomes full, it's the user's fault. By adding these
>     warnings we have a way of saying "We told you so!" Without
>     warnings we get bugs like [1] I mentioned in the first post.
>
>     Of course user can check disk space by hand but since we do have a
>     full-blown UI telling the user to periodically log in to the
>     console and check disks by hand seems a bit of a burden.
>
>     We can even implement such monitoring functionality as a Nailgun
>     plugin -- installing it would be optional and at the same time we
>     would grow our plugin ecosystem.
>
>     P.
>
>
>     On 11/05/2014 08:42 PM, Dmitry Borodaenko wrote:
>>     Even one additional hardware node required to host the Fuel
>>     master is seen by many users as excessive. Unless you can come up
>>     with an architecture that adds HA capability to Fuel without
>>     increasing its hardware footprint by 2 more nodes, it's just not
>>     worth it.
>>
>>     The only operational aspect of the Fuel master node that you
>>     don't want to lose even for a short while is logging. You'd be
>>     better off redirecting OpenStack environments' logs to a
>>     dedicated highly available logging server (which, of course, you
>>     already have in your environment), and deal with Fuel master node
>>     failures by restoring it from backups.
>>
>>     On Wed, Nov 5, 2014 at 8:26 AM, Anton Zemlyanov
>>     <azemlyanov at mirantis.com <mailto:azemlyanov at mirantis.com>> wrote:
>>
>>         Monitoring of the Fuel master's disk space is the special
>>         case. I really wonder why Fuel master have no HA option, disk
>>         overflow can be predicted but many other failures cannot. HA
>>         is a solution of the 'single point of failure' problem.
>>
>>         The current monitoring recommendations
>>         (http://docs.openstack.org/openstack-ops/content/logging_monitoring.html)
>>         are based on analyzing logs and manual checks, that are
>>         rather reactive way of fixing problems. Zabbix is quite good
>>         for preventing failures that are predictable but for the
>>         abrupt problems Zabbix just reports them 'post mortem'.
>>
>>         The only way to remove the single failure point is to
>>         implement redundancy/HA
>>
>>         Anton
>>
>>         On Tue, Nov 4, 2014 at 6:26 PM, Przemyslaw Kaminski
>>         <pkaminski at mirantis.com <mailto:pkaminski at mirantis.com>> wrote:
>>
>>             Hello,
>>
>>             In extension to my comment in this bug [1] I'd like to
>>             discuss the possibility of adding Fuel master node
>>             monitoring. As I wrote in the comment, when disk is full
>>             it might be already too late to perform any action since
>>             for example Nailgun could be down because DB shut itself
>>             down. So we should somehow warn the user that disk is
>>             running low (in the UI and fuel CLI on stderr for
>>             example) before it actually happens.
>>
>>             For now the only meaningful value to monitor would be
>>             disk usage -- do you have other suggestions? If not then
>>             probably a simple API endpoint with statvfs calls would
>>             suffice. If you see other usages of this then maybe it
>>             would be better to have some daemon collecting the stats
>>             we want.
>>
>>             If we opted for a daemon, then I'm aware that the user
>>             can optionally install Zabbix server although looking at
>>             blueprints in [2] I don't see anything about monitoring
>>             Fuel master itself -- is it possible to do? Though the
>>             installation of Zabbix though is not mandatory so it
>>             still doesn't completely solve the problem.
>>
>>             [1] https://bugs.launchpad.net/fuel/+bug/1371757
>>             [2]
>>             https://blueprints.launchpad.net/fuel/+spec/monitoring-system
>>
>>             Przemek
>>
>>             _______________________________________________
>>             OpenStack-dev mailing list
>>             OpenStack-dev at lists.openstack.org
>>             <mailto:OpenStack-dev at lists.openstack.org>
>>             http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>>
>>
>>
>>         _______________________________________________
>>         OpenStack-dev mailing list
>>         OpenStack-dev at lists.openstack.org
>>         <mailto:OpenStack-dev at lists.openstack.org>
>>         http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>>
>>
>>
>>
>>     -- 
>>     Dmitry Borodaenko
>>
>>
>>     _______________________________________________
>>     OpenStack-dev mailing list
>>     OpenStack-dev at lists.openstack.org  <mailto:OpenStack-dev at lists.openstack.org>
>>     http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>
>
>     _______________________________________________
>     OpenStack-dev mailing list
>     OpenStack-dev at lists.openstack.org
>     <mailto:OpenStack-dev at lists.openstack.org>
>     http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>
>
>
>
> _______________________________________________
> OpenStack-dev mailing list
> OpenStack-dev at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-dev/attachments/20141106/8e300908/attachment.html>


More information about the OpenStack-dev mailing list