[openstack-dev] [Swift] failure node muting not working

Zhou, Yuan yuan.zhou at intel.com
Thu Jul 4 01:18:08 UTC 2013


Got it. So Swift will try to enable the muted nodes after 60 seconds by default. Thanks.

-yuanz

-----Original Message-----
From: John Dickinson [mailto:me at not.mn] 
Sent: Thursday, July 04, 2013 1:23 AM
To: OpenStack Development Mailing List
Subject: Re: [openstack-dev] [Swift] failure node muting not working

Take a look at the proxy config, starting here: https://github.com/openstack/swift/blob/master/etc/proxy-server.conf-sample#L70

The error_suppression_interval and error_suppression_limit control the window you are looking for. With the default values, 10 errors in 60 seconds will prevent the proxy from using that particular storage node for another 60 seconds.

--John



On Jul 2, 2013, at 8:57 PM, "Zhou, Yuan" <yuan.zhou at intel.com> wrote:

> Hi lists,
>  
> We're trying to evaluate the node failure performance in Swift.
> According the docs Swift should be able to mute the failed nodes:
> 'if a storage node does not respond in a reasonable about of time, the proxy considers it to be unavailable and will not attempt to communicate with it for a while.'
>  
> We did a simple test on a 5 nodes cluster:
> 1.       Using COSBench to keep downloading files from the clusters.
> 2.       Stop the networking on SN1, there are lots of 'connection timeout 0.5s' error occurs in Proxy's log
> 3.       Keep workload running and wait for about 1hour
> 4.       The same error still occurs in Proxy, which means the node is not muted, but we expect the SN1 is muted in proxy side and there is no 'connection  timeout ' error in Proxy
>  
> So is there any special works needs to be done to use this feature?
>  
> Regards, -yuanz
>  
> _______________________________________________
> OpenStack-dev mailing list
> OpenStack-dev at lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev




More information about the OpenStack-dev mailing list