[Openstack] Problems with drbd + pacemaker in HA

Razique Mahroua razique.mahroua at gmail.com
Tue Feb 19 08:54:51 UTC 2013


+1
From the HA guide : 

> 4 Steps to solve the Split-Brain
> 
> Manually choose a node which data modifications will be discarded. We call it the split brain victim. 
> Choose wisely, all modifications will be lost! When in doubt run a backup of the victim's data before you continue.
> 
> When running a Pacemaker cluster, you can enable maintenance mode. If the split brain victim is in Primary role, bring down all applications using this resource. 
> Now switch the victim to Secondary role:
> 
> victim# drbdadm secondary resource
> victim# drbdadm disconnect resource
> victim# drbdadm -- --discard-my-data connect resource
> 
> for DRBD 8.4.x:
> victim# drbdadm connect --discard-my-data resource
> 
> Resync will start automatically if the survivor was in WFConnection network state. 
> If the split brain survivor is still in Standalone connection state, reconnect it:
> survivor# drbdadm conn
 :)

Razique Mahroua - Nuage & Co
razique.mahroua at gmail.com
Tel : +33 9 72 37 94 15



Le 19 févr. 2013 à 09:51, Sébastien Han <han.sebastien at gmail.com> a écrit :

> Hi,
> 
> If you stop the resource this is perfectly normal to get a resource diskless and unconfigured since you asked pacemaker to stop it. You just need to reconnect properly the resource 0 (guess it's the mysql one). For this, it's more or less the same operation as this one: http://www.drbd.org/users-guide-8.3/s-resolve-split-brain.html
> 
> Cheers 
> 
> --
> Regards,
> Sébastien Han.
> 
> 
> On Tue, Feb 19, 2013 at 2:38 AM, Samuel Winchenbach <swinchen at gmail.com> wrote:
> Hi All,
> 
> I recently switched from CentOS 6.3 to Ubuntu LTS server and have started encountering some really odd problems with drbd and pacemaker.   everything seems to work well until I stop a resource or migrate itl at which point drbd either becomes "Diskless" or "Unconfigured".  I am not sure what is happening but I never had this problem in CentOS.   
> 
> pacemaker configuration:  http://paste2.org/p/2905943
> drbd configuration: http://paste2.org/p/2905965
> 
> syslog output: http://paste2.org/p/2905988
> drbd state problems: http://paste2.org/p/2905951
> 
> 
> If someone could take a quick look at this I would be very grateful, I have been fighting this for a couple days now.
> 
> Thanks,
> Sam  
> 
> 
> _______________________________________________
> Mailing list: https://launchpad.net/~openstack
> Post to     : openstack at lists.launchpad.net
> Unsubscribe : https://launchpad.net/~openstack
> More help   : https://help.launchpad.net/ListHelp
> 
> 
> _______________________________________________
> Mailing list: https://launchpad.net/~openstack
> Post to     : openstack at lists.launchpad.net
> Unsubscribe : https://launchpad.net/~openstack
> More help   : https://help.launchpad.net/ListHelp

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack/attachments/20130219/c214d607/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: NUAGECO-LOGO-Fblan_petit.jpg
Type: image/jpeg
Size: 10122 bytes
Desc: not available
URL: <http://lists.openstack.org/pipermail/openstack/attachments/20130219/c214d607/attachment.jpg>


More information about the Openstack mailing list