<div dir="ltr">Thanks the advices, I have modified the config file generation!<div><br></div><div>Earlier I have asked several times about recommendations on the backend list, but somehow I must not have been clean enough, never got an answer.</div>

<div>Now it&#39;s OK, &quot;trusted servers&quot; are different than the pool members, and the backend list is the same for all.</div><div><br></div><div>will be back soon with the results!</div><div><br></div><div>Thanks again!</div>

<div><br></div><div>Regards,</div><div>Attila</div></div><div class="gmail_extra"><br><br><div class="gmail_quote">2014-04-23 5:10 GMT+02:00 Yugo Nagata <span dir="ltr">&lt;<a href="mailto:nagata@sraoss.co.jp" target="_blank">nagata@sraoss.co.jp</a>&gt;</span>:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br>
<div class=""><br>
On Tue, 22 Apr 2014 15:24:42 +0200<br>
Attila Heidrich &lt;<a href="mailto:attila.heidrich@gmail.com">attila.heidrich@gmail.com</a>&gt; wrote:<br>
<br>
&gt; Hi!<br>
&gt;<br>
&gt; The summary is generated by a script called &quot;pool&quot; :<br>
&gt; <a href="https://gist.github.com/dansimau/1582492" target="_blank">https://gist.github.com/dansimau/1582492</a><br>
&gt; Unfortunately there are no logs at all, logging was temporary disabled.<br>
<br>
</div>Without logs, I cannot analyze how and why the problem occured,<br>
though, in general, it can occur when heartbeat connection is broken.<br>
<div class=""><br>
&gt; &gt; &gt; This isn&#39;t the first time, usually happens in a high load situation.<br>
<br>
</div>The high load situation means that pgpool is receiving a lot of queries<br>
from clients?<br>
<br>
BTW, I find trusted_servers as the each other server and this is not<br>
recommended. When one server goes down, the other server also goes down<br>
since there is no alive trusted_servers.<br>
<br>
In addition, configurations about backend_hostname are different between<br>
pgpool.confs. These should be same.<br>
<div class="HOEnZb"><div class="h5"><br>
&gt;<br>
&gt; The current (normal) status is:<br>
&gt;<br>
&gt; root@control-1:~# salt postg\* cmd.run &quot;pool status&quot;<br>
&gt; postgres-1:<br>
&gt;     Node: 0<br>
&gt;     Host: postgres-1<br>
&gt;     Port: 5433<br>
&gt;     Weight: 0.500000<br>
&gt;     Status: Up, in pool (1)<br>
&gt;     Role: Master<br>
&gt;<br>
&gt;     Node: 1<br>
&gt;     Host: postgres-2<br>
&gt;     Port: 5433<br>
&gt;     Weight: 0.500000<br>
&gt;     Status: Up, in pool (1)<br>
&gt;     Role: Master<br>
&gt; postgres-2:<br>
&gt;     Node: 0<br>
&gt;     Host: postgres-2<br>
&gt;     Port: 5433<br>
&gt;     Weight: 0.500000<br>
&gt;     Status: Up, in pool and connected (2)<br>
&gt;     Role: Master<br>
&gt;<br>
&gt;     Node: 1<br>
&gt;     Host: postgres-1<br>
&gt;     Port: 5433<br>
&gt;     Weight: 0.500000<br>
&gt;     Status: Up, in pool and connected (2)<br>
&gt;     Role: Master<br>
&gt;<br>
&gt; IP address status<br>
&gt;<br>
&gt; postgres-2:<br>
&gt;     2: eth0: &lt;BROADCAST,MULTICAST,UP,LOWER_UP&gt; mtu 1500 qdisc mq state UP<br>
&gt; qlen 1000<br>
&gt;         link/ether 00:50:56:8f:7e:7b brd ff:ff:ff:ff:ff:ff<br>
&gt;         inet <a href="http://10.6.14.11/24" target="_blank">10.6.14.11/24</a> brd <a href="tel:10.6.14.255" value="+3610614255">10.6.14.255</a> scope global eth0<br>
&gt;            valid_lft forever preferred_lft forever<br>
&gt;         inet <a href="http://10.6.14.15/24" target="_blank">10.6.14.15/24</a> scope global secondary eth0<br>
&gt;            valid_lft forever preferred_lft forever<br>
&gt;         inet6 fe80::250:56ff:fe8f:7e7b/64 scope link<br>
&gt;            valid_lft forever preferred_lft forever<br>
&gt; postgres-1:<br>
&gt;     2: eth0: &lt;BROADCAST,MULTICAST,UP,LOWER_UP&gt; mtu 1500 qdisc mq state UP<br>
&gt; qlen 1000<br>
&gt;         link/ether 00:50:56:8f:58:ab brd ff:ff:ff:ff:ff:ff<br>
&gt;         inet <a href="http://10.6.14.10/24" target="_blank">10.6.14.10/24</a> brd <a href="tel:10.6.14.255" value="+3610614255">10.6.14.255</a> scope global eth0<br>
&gt;            valid_lft forever preferred_lft forever<br>
&gt;         inet6 fe80::250:56ff:fe8f:58ab/64 scope link<br>
&gt;            valid_lft forever preferred_lft forever<br>
&gt;<br>
&gt; Config files with modified passwords:<br>
&gt;<br>
&gt; postgres-2:<br>
&gt;     listen_addresses = &#39;*&#39;<br>
&gt;     port = 5432<br>
&gt;     socket_dir = &#39;/var/run/postgresql&#39;<br>
&gt;     pcp_port = 9898<br>
&gt;     pcp_socket_dir = &#39;/var/run/postgresql&#39;<br>
&gt;     backend_hostname0 = &#39;postgres-2&#39;<br>
&gt;     backend_port0 = 5433<br>
&gt;     backend_weight0 = 1<br>
&gt;     backend_data_directory0 = &#39;/var/lib/postgresql/9.3&#39;<br>
&gt;     backend_flag0 = &#39;ALLOW_TO_FAILOVER&#39;<br>
&gt;     backend_hostname1 = &#39;postgres-1&#39;<br>
&gt;     backend_port1 = 5433<br>
&gt;     backend_weight1 = 1<br>
&gt;     backend_data_directory1 = &#39;/var/lib/postgresql/9.3&#39;<br>
&gt;     backend_flag1 = &#39;ALLOW_TO_FAILOVER&#39;<br>
&gt;     enable_pool_hba = on<br>
&gt;     pool_passwd = &#39;&#39;<br>
&gt;     authentication_timeout = 60<br>
&gt;     ssl = off<br>
&gt;     num_init_children = 32<br>
&gt;     max_pool = 4<br>
&gt;     child_life_time = 300<br>
&gt;     child_max_connections = 0<br>
&gt;     connection_life_time = 0<br>
&gt;     client_idle_limit = 0<br>
&gt;     log_destination = &#39;syslog&#39;<br>
&gt;     print_timestamp = on<br>
&gt;     log_connections = off<br>
&gt;     log_hostname = on<br>
&gt;     log_statement = off<br>
&gt;     log_per_node_statement = on<br>
&gt;     log_standby_delay = &#39;none&#39;<br>
&gt;     syslog_facility = &#39;LOCAL0&#39;<br>
&gt;     syslog_ident = &#39;pgpool&#39;<br>
&gt;     debug_level = 0<br>
&gt;     pid_file_name = &#39;/var/run/postgresql/pgpool.pid&#39;<br>
&gt;     logdir = &#39;/var/log/postgresql&#39;<br>
&gt;     connection_cache = on<br>
&gt;     reset_query_list = &#39;ABORT; DISCARD ALL&#39;<br>
&gt;     replication_mode = on<br>
&gt;     replicate_select = off<br>
&gt;     insert_lock = on<br>
&gt;     lobj_lock_table = &#39;&#39;<br>
&gt;     replication_stop_on_mismatch = off<br>
&gt;     failover_if_affected_tuples_mismatch = off<br>
&gt;     health_check_period = 15<br>
&gt;     health_check_timeout = 5<br>
&gt;     health_check_user = &#39;pg_admin&#39;<br>
&gt;     health_check_password = &#39;P4ssw0rd&#39;<br>
&gt;     health_check_max_retries = 2<br>
&gt;     health_check_retry_delay = 1<br>
&gt;     failover_command = &#39;echo $(date): host:%h, new master id:%m, old master<br>
&gt; id:%M &gt;&gt; /var/lib/pgsql/failover.log&#39;<br>
&gt;     failback_command = &#39;echo $(date): host:%h, new master id:%m, old master<br>
&gt; id:%M &gt;&gt; /var/lib/pgsql/failback.log&#39;<br>
&gt;     fail_over_on_backend_error = on<br>
&gt;     recovery_user = &#39;pg_admin&#39;<br>
&gt;     recovery_password = &#39;P4ssw0rd&#39;<br>
&gt;     recovery_1st_stage_command = &#39;copy_base_backup&#39;<br>
&gt;     recovery_2nd_stage_command = &#39;switch_xlog&#39;<br>
&gt;     recovery_timeout = 90<br>
&gt;     client_idle_limit_in_recovery = 0<br>
&gt;     use_watchdog = on<br>
&gt;     trusted_servers = &#39;postgres-1&#39;<br>
&gt;     delegate_IP = &#39;10.6.14.15&#39;<br>
&gt;     wd_hostname = &#39;postgres-2&#39;<br>
&gt;     wd_port = 9000<br>
&gt;     wd_interval = 5<br>
&gt;     ping_path = &#39;/bin&#39;<br>
&gt;     ifconfig_path = &#39;/var/lib/postgresql&#39;<br>
&gt;     if_up_cmd = &#39;ip add add <a href="http://10.6.14.15/24" target="_blank">10.6.14.15/24</a> dev eth0&#39;<br>
&gt;     if_down_cmd = &#39;ip add del <a href="http://10.6.14.15/24" target="_blank">10.6.14.15/24</a> dev eth0&#39;<br>
&gt;     arping_path = &#39;/var/lib/postgresql&#39;<br>
&gt;     arping_cmd = &#39;arping -U 10.6.14.15 -w 1&#39;<br>
&gt;     wd_life_point = 3<br>
&gt;     wd_lifecheck_query = &#39;SELECT 1&#39;<br>
&gt;     wd_escalation_command = &#39;&#39;<br>
&gt;     wd_lifecheck_method = &#39;heartbeat&#39;<br>
&gt;     wd_interval = 15<br>
&gt;     wd_heartbeat_port = 9694<br>
&gt;     wd_heartbeat_keepalive = 5<br>
&gt;     wd_heartbeat_deadtime = 30<br>
&gt;     heartbeat_destination0 = &#39;postgres-1&#39;<br>
&gt;     heartbeat_destination_port0 = 9694<br>
&gt;     heartbeat_device0 = &#39;&#39;<br>
&gt;     other_pgpool_hostname0 = &#39;postgres-1&#39;<br>
&gt;     other_pgpool_port0 = 5432<br>
&gt;     other_wd_port0 = 9000<br>
&gt;     relcache_expire = 0<br>
&gt;     relcache_size = 256<br>
&gt;     check_temp_table = on<br>
&gt;     memory_cache_enabled = off<br>
&gt;     memqcache_method = &#39;shmem&#39;<br>
&gt;     memqcache_memcached_host = &#39;localhost&#39;<br>
&gt;     memqcache_memcached_port = 11211<br>
&gt;     memqcache_total_size = 67108864<br>
&gt;     memqcache_max_num_cache = 1000000<br>
&gt;     memqcache_expire = 0<br>
&gt;     memqcache_auto_cache_invalidation = on<br>
&gt;     memqcache_maxcache = 409600<br>
&gt;     memqcache_cache_block_size = 1048576<br>
&gt;     memqcache_oiddir = &#39;/var/log/pgpool/oiddir&#39;<br>
&gt;     white_memqcache_table_list = &#39;&#39;<br>
&gt;     black_memqcache_table_list = &#39;&#39;<br>
&gt; postgres-1:<br>
&gt;     listen_addresses = &#39;*&#39;<br>
&gt;     port = 5432<br>
&gt;     socket_dir = &#39;/var/run/postgresql&#39;<br>
&gt;     pcp_port = 9898<br>
&gt;     pcp_socket_dir = &#39;/var/run/postgresql&#39;<br>
&gt;     backend_hostname0 = &#39;postgres-1&#39;<br>
&gt;     backend_port0 = 5433<br>
&gt;     backend_weight0 = 1<br>
&gt;     backend_data_directory0 = &#39;/var/lib/postgresql/9.3&#39;<br>
&gt;     backend_flag0 = &#39;ALLOW_TO_FAILOVER&#39;<br>
&gt;     backend_hostname1 = &#39;postgres-2&#39;<br>
&gt;     backend_port1 = 5433<br>
&gt;     backend_weight1 = 1<br>
&gt;     backend_data_directory1 = &#39;/var/lib/postgresql/9.3&#39;<br>
&gt;     backend_flag1 = &#39;ALLOW_TO_FAILOVER&#39;<br>
&gt;     enable_pool_hba = on<br>
&gt;     pool_passwd = &#39;&#39;<br>
&gt;     authentication_timeout = 60<br>
&gt;     ssl = off<br>
&gt;     num_init_children = 32<br>
&gt;     max_pool = 4<br>
&gt;     child_life_time = 300<br>
&gt;     child_max_connections = 0<br>
&gt;     connection_life_time = 0<br>
&gt;     client_idle_limit = 0<br>
&gt;     log_destination = &#39;syslog&#39;<br>
&gt;     print_timestamp = on<br>
&gt;     log_connections = off<br>
&gt;     log_hostname = on<br>
&gt;     log_statement = off<br>
&gt;     log_per_node_statement = on<br>
&gt;     log_standby_delay = &#39;none&#39;<br>
&gt;     syslog_facility = &#39;LOCAL0&#39;<br>
&gt;     syslog_ident = &#39;pgpool&#39;<br>
&gt;     debug_level = 0<br>
&gt;     pid_file_name = &#39;/var/run/postgresql/pgpool.pid&#39;<br>
&gt;     logdir = &#39;/var/log/postgresql&#39;<br>
&gt;     connection_cache = on<br>
&gt;     reset_query_list = &#39;ABORT; DISCARD ALL&#39;<br>
&gt;     replication_mode = on<br>
&gt;     replicate_select = off<br>
&gt;     insert_lock = on<br>
&gt;     lobj_lock_table = &#39;&#39;<br>
&gt;     replication_stop_on_mismatch = off<br>
&gt;     failover_if_affected_tuples_mismatch = off<br>
&gt;     health_check_period = 15<br>
&gt;     health_check_timeout = 5<br>
&gt;     health_check_user = &#39;pg_admin&#39;<br>
&gt;     health_check_password = &#39;P4ssw0rd&#39;<br>
&gt;     health_check_max_retries = 2<br>
&gt;     health_check_retry_delay = 1<br>
&gt;     failover_command = &#39;echo $(date): host:%h, new master id:%m, old master<br>
&gt; id:%M &gt;&gt; /var/lib/pgsql/failover.log&#39;<br>
&gt;     failback_command = &#39;echo $(date): host:%h, new master id:%m, old master<br>
&gt; id:%M &gt;&gt; /var/lib/pgsql/failback.log&#39;<br>
&gt;     fail_over_on_backend_error = on<br>
&gt;     recovery_user = &#39;pg_admin&#39;<br>
&gt;     recovery_password = &#39;P4ssw0rd&#39;<br>
&gt;     recovery_1st_stage_command = &#39;copy_base_backup&#39;<br>
&gt;     recovery_2nd_stage_command = &#39;switch_xlog&#39;<br>
&gt;     recovery_timeout = 90<br>
&gt;     client_idle_limit_in_recovery = 0<br>
&gt;     use_watchdog = on<br>
&gt;     trusted_servers = &#39;postgres-2&#39;<br>
&gt;     delegate_IP = &#39;10.6.14.15&#39;<br>
&gt;     wd_hostname = &#39;postgres-1&#39;<br>
&gt;     wd_port = 9000<br>
&gt;     wd_interval = 5<br>
&gt;     ping_path = &#39;/bin&#39;<br>
&gt;     ifconfig_path = &#39;/var/lib/postgresql&#39;<br>
&gt;     if_up_cmd = &#39;ip add add <a href="http://10.6.14.15/24" target="_blank">10.6.14.15/24</a> dev eth0&#39;<br>
&gt;     if_down_cmd = &#39;ip add del <a href="http://10.6.14.15/24" target="_blank">10.6.14.15/24</a> dev eth0&#39;<br>
&gt;     arping_path = &#39;/var/lib/postgresql&#39;<br>
&gt;     arping_cmd = &#39;arping -U 10.6.14.15 -w 1&#39;<br>
&gt;     wd_life_point = 3<br>
&gt;     wd_lifecheck_query = &#39;SELECT 1&#39;<br>
&gt;     wd_escalation_command = &#39;&#39;<br>
&gt;     wd_lifecheck_method = &#39;heartbeat&#39;<br>
&gt;     wd_interval = 15<br>
&gt;     wd_heartbeat_port = 9694<br>
&gt;     wd_heartbeat_keepalive = 5<br>
&gt;     wd_heartbeat_deadtime = 30<br>
&gt;     heartbeat_destination0 = &#39;postgres-2&#39;<br>
&gt;     heartbeat_destination_port0 = 9694<br>
&gt;     heartbeat_device0 = &#39;&#39;<br>
&gt;     other_pgpool_hostname0 = &#39;postgres-2&#39;<br>
&gt;     other_pgpool_port0 = 5432<br>
&gt;     other_wd_port0 = 9000<br>
&gt;     relcache_expire = 0<br>
&gt;     relcache_size = 256<br>
&gt;     check_temp_table = on<br>
&gt;     memory_cache_enabled = off<br>
&gt;     memqcache_method = &#39;shmem&#39;<br>
&gt;     memqcache_memcached_host = &#39;localhost&#39;<br>
&gt;     memqcache_memcached_port = 11211<br>
&gt;     memqcache_total_size = 67108864<br>
&gt;     memqcache_max_num_cache = 1000000<br>
&gt;     memqcache_expire = 0<br>
&gt;     memqcache_auto_cache_invalidation = on<br>
&gt;     memqcache_maxcache = 409600<br>
&gt;     memqcache_cache_block_size = 1048576<br>
&gt;     memqcache_oiddir = &#39;/var/log/pgpool/oiddir&#39;<br>
&gt;     white_memqcache_table_list = &#39;&#39;<br>
&gt;     black_memqcache_table_list = &#39;&#39;<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; 2014-04-18 10:36 GMT+02:00 Yugo Nagata &lt;<a href="mailto:nagata@sraoss.co.jp">nagata@sraoss.co.jp</a>&gt;:<br>
&gt;<br>
&gt; &gt; Hi,<br>
&gt; &gt;<br>
&gt; &gt; On Tue, 15 Apr 2014 12:24:51 +0200<br>
&gt; &gt; Attila Heidrich &lt;<a href="mailto:attila.heidrich@gmail.com">attila.heidrich@gmail.com</a>&gt; wrote:<br>
&gt; &gt;<br>
&gt; &gt; &gt; Dear guys!<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; Where to find the problem in the situation above?<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; No logs at all, for some reason pgpool stopped logging (usually it uses<br>
&gt; &gt; &gt; syslog).<br>
&gt; &gt;<br>
&gt; &gt; The followings seem a summarised results of some command (pcp command or<br>
&gt; &gt; show pool_nodes?). I want to konw the actual command results. Could you<br>
&gt; &gt; please send me these, log messages, and pgpool.conf?<br>
&gt; &gt;<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; root@postgres-1:/etc/pgpool2# pool status<br>
&gt; &gt; &gt; Node: 0<br>
&gt; &gt; &gt; Host: postgres-1<br>
&gt; &gt; &gt; Port: 5433<br>
&gt; &gt; &gt; Weight: 0.500000<br>
&gt; &gt; &gt; Status: Up, in pool (1)<br>
&gt; &gt; &gt; Role: Master<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; Node: 1<br>
&gt; &gt; &gt; Host: postgres-2<br>
&gt; &gt; &gt; Port: 5433<br>
&gt; &gt; &gt; Weight: 0.500000<br>
&gt; &gt; &gt; Status: Up, in pool (1)<br>
&gt; &gt; &gt; Role: Master<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; root@postgres-2:/etc/pgpool2# pool status<br>
&gt; &gt; &gt; Node: 0<br>
&gt; &gt; &gt; Host: postgres-2<br>
&gt; &gt; &gt; Port: 5433<br>
&gt; &gt; &gt; Weight: 0.500000<br>
&gt; &gt; &gt; Status: Up, in pool and connected (2)<br>
&gt; &gt; &gt; Role: Master<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; Node: 1<br>
&gt; &gt; &gt; Host: postgres-1<br>
&gt; &gt; &gt; Port: 5433<br>
&gt; &gt; &gt; Weight: 0.500000<br>
&gt; &gt; &gt; Status: Up, in pool and connected (2)<br>
&gt; &gt; &gt; Role: Master<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; This isn&#39;t the first time, usually happens in a high load situation.<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; Attila<br>
&gt; &gt;<br>
&gt; &gt;<br>
&gt; &gt; --<br>
&gt; &gt; Yugo Nagata &lt;<a href="mailto:nagata@sraoss.co.jp">nagata@sraoss.co.jp</a>&gt;<br>
&gt; &gt;<br>
<br>
<br>
</div></div><span class="HOEnZb"><font color="#888888">--<br>
Yugo Nagata &lt;<a href="mailto:nagata@sraoss.co.jp">nagata@sraoss.co.jp</a>&gt;<br>
</font></span></blockquote></div><br></div>