[OpenStack-Infra] Ask.o.o down

Joshua Hesketh joshua.hesketh at gmail.com
Tue Feb 14 08:11:20 UTC 2017


Hey Tom,

Where is that script being fired from (a quick grep doesn't find it), or is
it a tool people are using?

If it's a tool we'd need to make sure whoever is using it gets a new
version to rule it out.

 - Josh

On Tue, Feb 14, 2017 at 7:07 PM, Tom Fifield <tom at openstack.org> wrote:

> On 14/02/17 16:06, Joshua Hesketh wrote:
>
>> Hey,
>>
>> I've brought the service back up, but have no new clues as to why.
>>
>
> Cheers.
>
> Going to try: https://review.openstack.org/#/c/433478/
> to see if this script is culprit.
>
>
> - Josh
>>
>> On Tue, Feb 14, 2017 at 6:50 PM, Tom Fifield <tom at openstack.org
>> <mailto:tom at openstack.org>> wrote:
>>
>>     On 10/02/17 22:39, Jeremy Stanley wrote:
>>
>>         On 2017-02-10 16:08:51 +0800 (+0800), Tom Fifield wrote:
>>         [...]
>>
>>             Down again, this time with "Network is unreachable".
>>
>>         [...]
>>
>>         I'm not finding any obvious errors on the server nor relevant
>>         maintenance notices/trouble tickets from the service provider to
>>         explain this. I do see conspicuous gaps in network traffic volume
>>         and system load from ~06:45 to ~08:10 UTC according to cacti:
>>
>>             http://cacti.openstack.org/?tree_id=1&leaf_id=156
>>         <http://cacti.openstack.org/?tree_id=1&leaf_id=156>
>>
>>         Skipping back through previous days I find some similar gaps
>>         starting anywhere from 06:30 to 07:00 and ending between 07:00 and
>>         08:00 but they don't seem to occur every day and I'm not having
>> much
>>         luck finding a pattern. It _is_ conspicuously close to when
>>         /etc/cron.daily scripts get fired from the crontab so might
>> coincide
>>         with log rotation/service restarts? The graphs don't show these
>> gaps
>>         correlating with any spikes in CPU, memory or disk activity so it
>>         doesn't seem to be resource starvation (at least not for any
>> common
>>         resources we're tracking).
>>
>>
>>     Indeed. It's down again today during the same timeslot.
>>
>>     Another idea for the cron-based theory:
>>
>>     https://github.com/openstack/uc-recognition/blob/master/tool
>> s/get_active_moderator.py
>>     <https://github.com/openstack/uc-recognition/blob/master/too
>> ls/get_active_moderator.py>
>>
>>     loops through the list of Ask OpenStack users via the API on a cron
>>     running on www.openstack.org <http://www.openstack.org>. Not sure
>>     when that cron runs, but if it's similar, this could potentially be
>>     a high-load generator.
>>
>>
>>
>>
>>     Regards,
>>
>>
>>     Tom
>>
>>
>>     _______________________________________________
>>     OpenStack-Infra mailing list
>>     OpenStack-Infra at lists.openstack.org
>>     <mailto:OpenStack-Infra at lists.openstack.org>
>>     http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-infra
>>     <http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-infra>
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-infra/attachments/20170214/faa3f1fd/attachment.html>


More information about the OpenStack-Infra mailing list