[Openstack] [swift] Storage node failure modes

Mark Kirkwood mark.kirkwood at catalyst.net.nz
Tue Jun 28 00:36:01 UTC 2016


Hi,

I'm in the process of documenting failure modes (for ops documentation 
etc). Now I think I understand the intent:

- swift tries to ensure you always have the number of configured replicas

In the case of missing or unmounted devices I'm seeing the expected 
behaviour i.e:

- new object creation results in the configured number of replicas (some 
stored on handoff nodes)
- existing objects replicated on handoff to produce the correct replica 
number

In the case of a node (or a region) I'm *not* seeing analogous behaviour 
for *existing* objects, i.e I am a replica down after shutting down on 
of my nodes and waiting a while.

I am testing using swift 2.7.on a small cluster of vms (4 nodes, 4 
devices, 2 regions) - now it may be that my setup is just too trivial 
(or maybe I haven't waited long enough for swift decide my storage node 
is really down). Any thoughts? I'd like to understand precisely what is 
supposed to happen when a node (and also an entire region) is unavailable.

Cheers

Mark




More information about the Openstack mailing list