Sponsored Content
Operating Systems AIX strange route issue in powerha 5.4? Post 302328480 by rs6000er on Wednesday 24th of June 2009 11:28:49 AM
Old 06-24-2009
strange route issue in powerha 5.4?

hi all



we upgraded hacmp(powerha) from 5.2 to 5.4 recently .



during the failover testing, we found a strange network issue. after standby node took service ip address (172.15.100.8) online at standby NIC, we were able to log in the standby node by telnet 172.15.100.8 which stays at standby node standyb NIC.



when we tried to run ping or traceroute to a ip address outside of of our network, for example google.com or ibm.com, the NIC that holds service ip address were not able to send any packet out at all.



instead, the primay NIC that holds boot address of standby node (172.15.103.79) will take in place and send the ping packet out or traceroute to out side of world.



on the other hand, any testing within our intranet acted normal. we checked dns server and routing table of standby node and gateway setup, they were all good.



the follwoing info is the routing configuration from standby node.



Routing tables
Destination Gateway Flags Refs Use If Exp Groups

Route Tree for Protocol Family 2 (Internet):
default 172.15.100.150 UG 1 28215 en4 - -
127/8 127.0.0.1 U 9 921983 lo0 - -
172.15.100.0 172.15.103.79 UHSb 0 0 en4 - - =>
172.15.100/22 172.15.103.79 U 6 4212671 en4 - -
172.15.103.79 127.0.0.1 UGHS 0 671937 lo0 - -
172.15.103.255 172.15.103.79 UHSb 0 288 en4 - -
172.16.60.0 172.16.60.60 UHSb 0 0 en5 - - =>
172.16.60/22 172.16.60.60 U 2 2712283 en5 - -
172.16.60.60 127.0.0.1 UGHS 0 382645 lo0 - -
172.16.63.255 172.16.60.60 UHSb 0 1 en5 - -

Route Tree for Protocol Family 24 (Internet v6):
::1 ::1 UH 0 236 lo0 - -


as you may see, the default routing is set up for en4 (the primay NIC that holds boot ip address). is it possible that we manally set up a route entry to let en5 ( the standby NIC that will holds the service ip address when failover) go out side as well like this?



Routing tables
Destination Gateway Flags Refs Use If Exp Groups

Route Tree for Protocol Family 2 (Internet):
default 172.15.100.150 UG 1 28217 en4 - -
127/8 127.0.0.1 U 10 922176 lo0 - -
172.15.100.0 172.15.103.79 UHSb 0 0 en4 - - =>
172.15.100/22 172.15.103.79 U 6 4213902 en4 - - =>
172.15.100/22 172.15.100.8 UG 0 6 en5 - -
172.15.103.79 127.0.0.1 UGHS 0 672133 lo0 - -
172.15.103.255 172.15.103.79 UHSb 0 288 en4 - -
172.16.60.0 172.16.60.60 UHSb 0 0 en5 - - =>
172.16.60/22 172.16.60.60 U 2 2712830 en5 - -
172.16.60.60 127.0.0.1 UGHS 0 382726 lo0 - -
172.16.63.255 172.16.60.60 UHSb 0 1 en5 - -

Route Tree for Protocol Family 24 (Internet v6):
::1 ::1 UH 0 236 lo0 - -
 

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Strange Number comparison issue

Hi, I am comparing two numbers, but it gives strange results: My Code: if then echo "True" else echo "False" fi This code gives False for the follwoing comparison where as True for the following: Any reason for this? Both Should have given False... I am using... (9 Replies)
Discussion started by: shihabvk
9 Replies

2. UNIX for Dummies Questions & Answers

Strange issue

I have created a server monitor program in Java that connects to servers via SSH and constantly refreshes the new server load every 3 mins. The problem is that even though it is a Java app running shell commands to get the load info, the commands still appear in 'history'. The question is.. is... (1 Reply)
Discussion started by: AndrewSH
1 Replies

3. UNIX for Advanced & Expert Users

strange aunthentication issue with email

hi again i'm running a centos 4.x box with sendmail and openwebmail. I had a weird situation where users couldn't not get authenticated using a pop mail client but but were able to aunthenticate using the openwebmail webmail client. I looked through the logs and didn't see anything strange. I... (0 Replies)
Discussion started by: mcraul
0 Replies

4. Solaris

strange issue with ALOM in t51...

Hi , running solaris 10 on t5120 and i see ILOM and ALOM (SC) interfaces working fine earlier and suddenly they are responding strangely, First thing , I can connect fine to ILOM or ALOM but then console -f just hangs.. resetsc error : I do not really want to poweroff and poweron... (13 Replies)
Discussion started by: upengan78
13 Replies

5. Web Development

Strange Mysql issue

Hi all, I recently changed the name of my hostname from 'abc123' to 'abc456' (as an example). I then added a user in my mysql database with the new host and removed references to the old users and hostname. The strange thing is though, a process using the database still uses the old... (1 Reply)
Discussion started by: muay_tb
1 Replies

6. Solaris

multicast arp/route issue

Hi forum members, I've run into a problem that I'm hoping you can sort out. Whenever I run an arp -a or netstat -p the command hangs. It seems that the system is trying to reverse lookup a multicast address and this is what causes the issues. The commands are actually generated via... (2 Replies)
Discussion started by: melias
2 Replies

7. Solaris

Route Addition Issue

Hi, I wanted to add route on the solaris 10 box as shown below. route add -net 172.16.10.0 172.16.10.11 where 172.16.10.11 is the ip address of the solaris machine. If I execute the above command I am getting the below error gateway:172.16.10.11 Network is unreachable. I am not... (1 Reply)
Discussion started by: ravi_rn
1 Replies

8. UNIX for Dummies Questions & Answers

Strange Keyboard and Mouse Issue

Hello All, PC: CuBox-i (*i.MX6) Mini-PC OS: openSUSE 13.1 (Bottle) (armv7hl) Kernel: 3.14.14-cubox-i # uname -a Linux CuBox-HQ 3.14.14-cubox-i #1 SMP Sat Sep 13 03:48:24 UTC 2014 armv7l armv7l armv7l GNU/LinuxSo I've been having this random issue happen on this PC where a few strange... (12 Replies)
Discussion started by: mrm5102
12 Replies

9. Shell Programming and Scripting

Strange Issue with sendmail

more works.sh #!/bin/ksh { print "From: reportgenerator@myserver.com" print "To: randomguy@myfirm.com" print "MIME-Version: 1.0" print "Content-Type: text/html" print "Subject: Disk Report" print "<body>" print "<table border=1>" ... (3 Replies)
Discussion started by: mohtashims
3 Replies

10. Solaris

Default route issue in share local zone in Solaris 11

Hi We took the reboot of server/global zone after that Solaris 11 share local zone not able to ping the default gateway but global zone able to ping same default gateway. I check/compare the before activity logs of global zone and local zones which looks fine to me, further IP and netmask of... (6 Replies)
Discussion started by: sb200
6 Replies
PG_STANDBY(1)						  PostgreSQL 9.2.7 Documentation					     PG_STANDBY(1)

NAME
pg_standby - supports the creation of a PostgreSQL warm standby server SYNOPSIS
pg_standby [option...] archivelocation nextwalfile xlogfilepath [restartwalfile] DESCRIPTION
pg_standby supports creation of a "warm standby" database server. It is designed to be a production-ready program, as well as a customizable template should you require specific modifications. pg_standby is designed to be a waiting restore_command, which is needed to turn a standard archive recovery into a warm standby operation. Other configuration is required as well, all of which is described in the main server manual (see Section 25.2, "Log-Shipping Standby Servers", in the documentation). To configure a standby server to use pg_standby, put this into its recovery.conf configuration file: restore_command = 'pg_standby archiveDir %f %p %r' where archiveDir is the directory from which WAL segment files should be restored. If restartwalfile is specified, normally by using the %r macro, then all WAL files logically preceding this file will be removed from archivelocation. This minimizes the number of files that need to be retained, while preserving crash-restart capability. Use of this parameter is appropriate if the archivelocation is a transient staging area for this particular standby server, but not when the archivelocation is intended as a long-term WAL archive area. pg_standby assumes that archivelocation is a directory readable by the server-owning user. If restartwalfile (or -k) is specified, the archivelocation directory must be writable too. There are two ways to fail over to a "warm standby" database server when the master server fails: Smart Failover In smart failover, the server is brought up after applying all WAL files available in the archive. This results in zero data loss, even if the standby server has fallen behind, but if there is a lot of unapplied WAL it can be a long time before the standby server becomes ready. To trigger a smart failover, create a trigger file containing the word smart, or just create it and leave it empty. Fast Failover In fast failover, the server is brought up immediately. Any WAL files in the archive that have not yet been applied will be ignored, and all transactions in those files are lost. To trigger a fast failover, create a trigger file and write the word fast into it. pg_standby can also be configured to execute a fast failover automatically if no new WAL file appears within a defined interval. OPTIONS
pg_standby accepts the following command-line arguments: -c Use cp or copy command to restore WAL files from archive. This is the only supported behavior so this option is useless. -d Print lots of debug logging output on stderr. -k Remove files from archivelocation so that no more than this many WAL files before the current one are kept in the archive. Zero (the default) means not to remove any files from archivelocation. This parameter will be silently ignored if restartwalfile is specified, since that specification method is more accurate in determining the correct archive cut-off point. Use of this parameter is deprecated as of PostgreSQL 8.3; it is safer and more efficient to specify a restartwalfile parameter. A too small setting could result in removal of files that are still needed for a restart of the standby server, while a too large setting wastes archive space. -r maxretries Set the maximum number of times to retry the copy command if it fails (default 3). After each failure, we wait for sleeptime * num_retries so that the wait time increases progressively. So by default, we will wait 5 secs, 10 secs, then 15 secs before reporting the failure back to the standby server. This will be interpreted as end of recovery and the standby will come up fully as a result. -s sleeptime Set the number of seconds (up to 60, default 5) to sleep between tests to see if the WAL file to be restored is available in the archive yet. The default setting is not necessarily recommended; consult Section 25.2, "Log-Shipping Standby Servers", in the documentation for discussion. -t triggerfile Specify a trigger file whose presence should cause failover. It is recommended that you use a structured file name to avoid confusion as to which server is being triggered when multiple servers exist on the same system; for example /tmp/pgsql.trigger.5432. -V, --version Print the pg_standby version and exit. -w maxwaittime Set the maximum number of seconds to wait for the next WAL file, after which a fast failover will be performed. A setting of zero (the default) means wait forever. The default setting is not necessarily recommended; consult Section 25.2, "Log-Shipping Standby Servers", in the documentation for discussion. -?, --help Show help about pg_standby command line arguments, and exit. NOTES
pg_standby is designed to work with PostgreSQL 8.2 and later. PostgreSQL 8.3 provides the %r macro, which is designed to let pg_standby know the last file it needs to keep. With PostgreSQL 8.2, the -k option must be used if archive cleanup is required. This option remains available in 8.3, but its use is deprecated. PostgreSQL 8.4 provides the recovery_end_command option. Without this option a leftover trigger file can be hazardous. pg_standby is written in C and has an easy-to-modify source code, with specifically designated sections to modify for your own needs EXAMPLES
On Linux or Unix systems, you might use: archive_command = 'cp %p .../archive/%f' restore_command = 'pg_standby -d -s 2 -t /tmp/pgsql.trigger.5442 .../archive %f %p %r 2>>standby.log' recovery_end_command = 'rm -f /tmp/pgsql.trigger.5442' where the archive directory is physically located on the standby server, so that the archive_command is accessing it across NFS, but the files are local to the standby (enabling use of ln). This will: o produce debugging output in standby.log o sleep for 2 seconds between checks for next WAL file availability o stop waiting only when a trigger file called /tmp/pgsql.trigger.5442 appears, and perform failover according to its content o remove the trigger file when recovery ends o remove no-longer-needed files from the archive directory On Windows, you might use: archive_command = 'copy %p ...\archive\%f' restore_command = 'pg_standby -d -s 5 -t C:pgsql.trigger.5442 ...archive %f %p %r 2>>standby.log' recovery_end_command = 'del C:pgsql.trigger.5442' Note that backslashes need to be doubled in the archive_command, but not in the restore_command or recovery_end_command. This will: o use the copy command to restore WAL files from archive o produce debugging output in standby.log o sleep for 5 seconds between checks for next WAL file availability o stop waiting only when a trigger file called C:pgsql.trigger.5442 appears, and perform failover according to its content o remove the trigger file when recovery ends o remove no-longer-needed files from the archive directory The copy command on Windows sets the final file size before the file is completely copied, which would ordinarily confuse pg_standby. Therefore pg_standby waits sleeptime seconds once it sees the proper file size. GNUWin32's cp sets the file size only after the file copy is complete. Since the Windows example uses copy at both ends, either or both servers might be accessing the archive directory across the network. AUTHOR
Simon Riggs <simon@2ndquadrant.com> SEE ALSO
pg_archivecleanup(1) PostgreSQL 9.2.7 2014-02-17 PG_STANDBY(1)
All times are GMT -4. The time now is 12:36 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy