[pgpool-general: 2080] Re: Suggestion about two datacenters connected through WAN

Tue Aug 27 07:41:04 JST 2013

> Hi Tatsuo.
> I resolved the issue with streaming replication and corrupted postmaster.pid
> by formating disk mounted as PostgreSQL data directory. There wasn't another
> way.
> Hopefully it is resolved.
> 
> Only explanation to the pid file issue was the file system corruption like you 
> suggested.
> 
> Nevertheless I would like to continue to setup Pgpool-II.
> The streaming replication from TC1 to TC2 is working now.
> Pgpool is installed on the node 1 with PostgreSQL in the TC1 and on
> node 1 with PostgreSQL in th TC2.
> 
>>From Pgpool manual pages I understand how it would work when TC1 goes down and
> in front of TC1 sits Pgpool, so the Pgpool would be at all times available:
> 1. TC1 goes down.
> 2. Pgpool determines that the PostgreSQL master is unavailable and creates
> trigger file which makes PostgreSQL in TC2 R/W.
> 
> But in my case I have installed PostgreSQL with Pgpool on the same server. 
> With TC1
> goes also Pgpool down.

No problem. pgpool in TC2 should find that PostgreSQL in TC1 is not
available and promotes PostgreSQL in TC2.

> I don't understand the procedure of fail-over by using secondary Pgpool
> installed in TC2. Is there the same configuration of the Pgpool in TC2?

Yes.

>  Isn't 
> this configuration prone to the split-brain situations when network failure on 
> the WAN link occures? In that case Pgpool in TC2 makes PostgreSQL master and 
> then there will be R/W master in TC1 and also in TC2. Or am I wrong?

Pgpool cannot deal with the split-brain because the network between
TC1 and TC2 are all down. This can be said to any HA software I know
of.

One thing you might want to use for this is,
"trusted_servers". Usually you register router IPs you trust (pgpool
should be able to reach the router) for this. In the split brain case,
if pgpool in TC1 or TC2 finds that it cannot reach the IP anymore, it
sucides to avoid going the split brain.
--
Tatsuo Ishii
SRA OSS, Inc. Japan
English: http://www.sraoss.co.jp/index_en.php
Japanese: http://www.sraoss.co.jp

> Best regards,
> Michal Mistina
> -----Original Message-----
> From: pgpool-general-bounces at pgpool.net
> [mailto:pgpool-general-bounces at pgpool.net] On Behalf Of Mistina Michal
> Sent: Monday, August 19, 2013 9:14 AM
> To: Tatsuo Ishii
> Cc: pgpool-general at pgpool.net
> Subject: [pgpool-general: 2051] Re: Suggestion about two datacenters
> connected through WAN
> 
> Hi Tatsuo.
> Thank you for the reply.
>>> Yes, I am using DRBD. But there is a layer - pacemaker, which should
>>> handle DRBD.
>>> It works on what they called "resources". As a resource we understand
>>> service. In my case the services are: VirtualIP address, PostgreSQL,
>>> FileSystem (XFS), DRBD, LVM. Pacemaker controls the order of stopping
>>> and starting services on each node. The order of stopping services in
>>> my environment is like this (each node): Turn off VirtualIP address
>>> -> Stop PostgreSQL -> Dismount Filesystem -> Dismount LVM Virtual
>>> Groups . The DRBD device was in primary role on node 1. As the last
>>> thing (after everyghing is stopped) the DRBD is promoted to primary
>>> role on node 2 and degraded to secondary role on node 1. Then each
>>> service in reverse order is started on the node 2.
>>>
>>> This worked until I attached streaming replication slave.
>>>
>>> I read, the pgpool can run also as the resource, so it is started as
>>> the last service after PostgreSQL.
> 
>>So the PostgreSQL database cluster directory is on the file system
>>managed
> by Pacemaker? That means on the secondary node initially the file system is
> not mounted, that makes PostgreSQL on the secondary node cannot start
> because there's no file system. Am I missing something?
> 
> Yes, you are correct. PostgreSQL data directory resides on the file system
> managed by Pacemaker. In one moment there is only one PostgreSQL running
> only on one node and there is the file system mounted only on one node. File
> system corruption can only occure when there are split-brain conditions and
> both nodes are automatically promoted to drbd primary role while not aware
> of other node is in primary role. If the Pacemaker did not stop PostgreSQL
> server on the first node (it failed) it does not continue to start it on the
> secondary node without mounted file system.
> 
>>> We needed some level of the "dumb resistence" which means if somebody
>>> accidentally turn off or reboot one node, the services automatically
>>> start on the node 2. If somebody then reboots node 2 and node 1 is up
>>> services are automatically booted on node 1.
>>>
>>> Do you think 4 nodes can be handled by pgpool with some level of that
>>> "dumb resistence"? I saw pgpool has the option to set up weight on
>>> nodes. Is it a good option to set which servers are in the primary
> technical center?
>>> TC1: node 1, node 2
>>> TC2: node 3, node 4
> 
>>The "weight" parameter is nothing to do with making particular server
>>as
> "the primary technical center". Do you have any well defined policy to
> promote which node when the primary node goes down? I mean, you have 3
> candidates (node2, 3, 4) to be promoted and you need to decide which one
> should be promoted in the case. If you have a well defined policy, which a
> computer program can implement, you could set up "dumb Resistance" system.
> 
> You mean within Pacemaker? The Pacemaker in TC1 does not know about
> Pacemaker in TC2. Therefore TC1 does not influence function in the TC2.
> Within TC1 stopping resources with Pacemaker stopped in failure on the step
> of stopping PostgreSQL resource. Therefore it does not continue to stop
> other resources (dismounting system, ...) and does not start resources on
> the node 2.
> 
> If you mean project policies, the policies are written like this:
> 1. Automatic fail-over should at first occur within TC1.
> 2. If the whole TC1 goes down (e.g. it was flooded) the TC2 should take
> over. If there is a possibility to make it automatically, we should use it.
> If not, the automatic fail-over can do that.
> 
> I was searching for technologies to achieve aforementioned conditions and
> would like to know if pgpool itself can somehow do that. But it seems manual
> fail-over can be done only with streaming replication and automatic
> fail-over can be done only with pgpool / streaming-replication. So
> everything works only with streaming replication.
> 
> 
> Best regards,
> Michal Mistina
> 
>