[pgpool-general: 2002] R: Re: pgpool-II online recovery via rsync: postgresql won't start after the recovery

Tue Aug 6 16:46:34 JST 2013

Dear all.

Thank you Tatsuo for your reply.

Please find attached pgpool.conf and the pgpool_recovery script as per your 
request. I also attached the pgpool_remote_start script which is essentially as 
the one in the sample folder of pgpool-II with some additional logging to 
central log storage.

Thank you.

Kind Regards,

Luigi

>----Messaggio originale----
>Da: ishii at postgresql.org
>Data: 06/08/2013 2.36
>A: <armaeth at libero.it>
>Cc: <pgpool-general at pgpool.net>
>Ogg: Re: [pgpool-general: 1979] pgpool-II online recovery via rsync: 
postgresql won't start after the recovery
>
>What did you exactly setup pgpool-II? Can you please show me
>pgpool.conf and the first stage script and the second stage script? I
>assume you are trying to setup native replication mode (not straming
>replication) because the script you are showing is for native
>replication mode.
>--
>Tatsuo Ishii
>SRA OSS, Inc. Japan
>English: http://www.sraoss.co.jp/index_en.php
>Japanese: http://www.sraoss.co.jp
>
>> Hi all.
>> 
>>  
>> 
>> I am running postgresql 9.1.9 and pgpool 3.1.6 under CentOS 6.4.
>> 
>>  
>> 
>> I tried to set up online recovery as per pgpool-II user manual, section 
"Online recovery with rsync".
>> 
>>  
>> 
>> For this purpose I am using the pgpool_recovery script that's under the 
sample folder and available on github @:
>> 
>> https://github.com/iakio/pgpool-II/blob/master/sample/pgpool_recovery
>> 
>>  
>> 
>> The problem I have is that on the server being recovered postgresql won't 
start and log the following under pg_log:
>> 
>>  
>> 
>> LOG: database system was shut down at 2013-07-31 18:02:12 BST
>> LOG: WAL file is from different database system
>> DETAIL: WAL file database system identifier is 5894553353380019103, 
pg_control database system identifier is 5894200307068334067.
>> LOG: invalid primary checkpoint record
>> LOG: invalid secondary checkpoint record
>> PANIC: could not locate a valid checkpoint record
>> LOG: startup process (PID 1549) was terminated by signal 6: Aborted
>> 
>>  
>> 
>> I guess that the folders being rsync'ed by the pgpool_recovery script are 
not comprehensive?
>> 
>> 
>> I wa tempted to rsync the whole PG_DATA folder, but then I thought that 
there are a number of things that I don't want to overwrite, such as the logs 
under pg_log on the failed node.
>> 
>>  
>> 
>> Would anybody be able to make any recommendation on this and make the 
necessary changes to the official pgpool_recovery file in the code repository, 
please?
>> 
>>  
>> 
>> Thank you.
>> 
>>  
>> 
>> Kind Regards,
>> 
>>  
>> 
>> Luigi
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pgpool_remote_start
Type: application/octet-stream
Size: 356 bytes
Desc: not available
URL: <http://www.pgpool.net/pipermail/pgpool-general/attachments/20130806/c1405349/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pgpool_recovery
Type: application/octet-stream
Size: 674 bytes
Desc: not available
URL: <http://www.pgpool.net/pipermail/pgpool-general/attachments/20130806/c1405349/attachment-0001.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pgpool.conf
Type: application/octet-stream
Size: 20117 bytes
Desc: not available
URL: <http://www.pgpool.net/pipermail/pgpool-general/attachments/20130806/c1405349/attachment-0002.obj>