in.mpathd Cannot meet requested failure detection time


 
Thread Tools Search this Thread
Operating Systems Solaris in.mpathd Cannot meet requested failure detection time
# 1  
Old 06-28-2011
in.mpathd Cannot meet requested failure detection time

Hello World

I am facing following issue on machine


HW:
Sun Fire X4200 M2
OS:
Solaris 10/08 s10x_u6wos_07b X86
Code:
Errors:
Jun 28 08:11:46 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 24528 ms on (inet nge1) for group "prd"
Jun 28 08:11:46 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 12264 ms on (inet e1000g1) for group "prd"
Jun 28 08:11:47 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet nge1) for group "prd"
Jun 28 10:45:33 backupsrv in.mpathd[197]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet nge1) new failure detection time for group "prd" is 41230 ms
Jun 28 10:46:33 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 20615 ms on (inet nge1) for group "prd"
Jun 28 10:46:33 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 10307 ms on (inet e1000g1) for group "prd"
Jun 28 10:46:35 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet e1000g1) for group "prd"
Jun 28 15:01:27 backupsrv in.mpathd[197]: [ID 594170 daemon.error] NIC failure detected on nge1 of group prd
Jun 28 15:01:27 backupsrv in.mpathd[197]: [ID 832587 daemon.error] Successfully failed over from NIC nge1 to NIC e1000g1
Jun 28 15:01:29 backupsrv in.mpathd[197]: [ID 299542 daemon.error] NIC repair detected on nge1 of group prd
Jun 28 15:01:29 backupsrv in.mpathd[197]: [ID 620804 daemon.error] Successfully failed back to NIC nge1
Jun 28 15:02:27 backupsrv in.mpathd[197]: [ID 585766 daemon.error] Cannot meet requested failure detection time of 10000 ms on (inet e1000g1) new failure detection time for group "prd" is 153664 ms
Jun 28 15:03:27 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 76832 ms on (inet e1000g1) for group "prd"
Jun 28 15:03:27 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 38416 ms on (inet nge1) for group "prd"
Jun 28 15:03:28 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 19208 ms on (inet nge1) for group "prd"
Jun 28 15:03:29 backupsrv in.mpathd[197]: [ID 302819 daemon.error] Improved failure detection time 10000 ms on (inet e1000g1) for group "prd"

I have checked there is no issue on switch on which these interfaces are connected.
No crc errors.
These interfaces are full duplex and 1000Mbps autoneg on on both machine and switch
I don't know why such errors pop up everyday

Smilie
Below is network config:
Code:
root@backupsrv# uname -a
SunOS backupsrv 5.10 Generic_138889-03 i86pc i386 i86pc
root@backupsrv# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000 
e1000g0: flags=201000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4,CoS> mtu 1500 index 2
        inet 192.168.255.2 netmask ffffff00 broadcast 192.168.255.255
        ether 0:21:28:10:63:6c 
e1000g1: flags=269040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,STANDBY,INACTIVE,CoS> mtu 1500 index 3
        inet 172.18.190.26 netmask ffffffe0 broadcast 172.18.190.31
        groupname prd
        ether 0:21:28:10:63:6d 
nge0: flags=201000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4,CoS> mtu 1500 index 4
        inet 10.20.30.90 netmask ffffff00 broadcast 10.20.30.255
        ether 0:21:28:10:63:6a 
nge1: flags=201000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4,CoS> mtu 1500 index 5
        inet 172.18.190.27 netmask ffffffe0 broadcast 172.18.190.31
        groupname prd
        ether 0:21:28:10:63:6b 
nge1:1: flags=209040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER,CoS> mtu 1500 index 5
        inet 172.18.190.25 netmask ffffffe0 broadcast 172.18.190.31
nxge0: flags=1201000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4,CoS,FIXEDMTU> mtu 9000 index 6
        inet 192.168.254.2 netmask ffffff00 broadcast 192.168.254.255
        ether 0:21:28:1e:90:50 
root@backupsrv# netstat -in
Name  Mtu  Net/Dest      Address        Ipkts  Ierrs Opkts  Oerrs Collis Queue 
lo0   8232 127.0.0.0     127.0.0.1      29572877 0     29572877 0     0      0     
e1000g0 1500 192.168.255.0 192.168.255.2  823122402 0     331691531 0     0      0     
e1000g1 1500 172.18.190.0  172.18.190.26  312615 0     301785 0     0      0     
nge0  1500 10.20.30.0    10.20.30.90    501750 0     130462 0     0      0     
nge1  1500 172.18.190.0  172.18.190.27  38355391 0     47160563 0     0      0     
nxge0 9000 192.168.254.0 192.168.254.2  126187049 0     64678723 0     0      0     
 
root@backupsrv# netsat -nr
bash: netsat: command not found
root@backupsrv# netstat -nr
 
Routing Table: IPv4
  Destination           Gateway           Flags  Ref     Use     Interface 
-------------------- -------------------- ----- ----- ---------- --------- 
default              172.18.190.14        UG        1        783           
10.20.30.0           10.20.30.90          U         1        360 nge0      
172.18.190.0         172.18.190.27        U         1        323 nge1      
172.18.190.0         172.18.190.25        U         1          0 nge1:1    
172.18.190.0         172.18.190.26        U         1        197 e1000g1   
192.168.254.0        192.168.254.2        U         1        205 nxge0     
192.168.255.0        192.168.255.2        U         1       1135 e1000g0   
224.0.0.0            172.18.190.27        U         1          0 nge1      
127.0.0.1            127.0.0.1            UH      863    4619931 lo0

# 2  
Old 06-28-2011
Man Page for in.mpathd (All Section 1m) - The UNIX and Linux Forums

Cannot meet requested failure detection time of time ms on (inet[6]
interface_name) new failure detection time for group group_name is time
ms
Description:

The round trip time for ICMP probes is higher than necessary to
maintain the current failure detection time. The network is proba-
bly congested or the probe targets are loaded. in.mpathd automati-
cally increases the failure detection time to whatever it can
achieve under these conditions.

Improved failure detection time time ms on (inet[6] interface_name) for
group group_name
Description:

The round trip time for ICMP probes has now decreased and in.mpathd
has lowered the failure detection time correspondingly.

Congestion and too aggressive failover configuration? If an ethernet fails over, the old will be in Windows arp cache 5 minutes as I recall, soooooooo . . . .
# 3  
Old 06-30-2011
Network /firewall problem

I have faced the same sometime back.
If no hardware problem on your side its better check with network admins to see if theres any firewall or network/traffic problem..
# 4  
Old 09-14-2011
This is fairly normal for ipmp ..note:
Improved failure detection time 245*** .. the detection time is getting shorter..

There should not be anything wrong there ..
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

SQL Script HELP Requested.

Hello ALL , i am requesting help on for this script i am preparing to get the result of a query in a excel sheet : current Error: Script : NO Excel file created. requesting to know where i am going wrong. #!/bin/ksh... (2 Replies)
Discussion started by: anirudhkashikar
2 Replies

2. UNIX for Dummies Questions & Answers

boot up failure unix sco after power failure

hi power went out. next day unix sco wont boot up error code 303. any help appreciated as we are clueless. (11 Replies)
Discussion started by: fredthayer
11 Replies

3. Programming

Parallel Processing Detection and Program Return Value Detection

Hey, for the purpose of a research project I need to know if a specific type of parallel processing is being utilized by any user-run programs. Is there a way to detect whether a program either returns a value to another program at the end of execution, or just utilizes any form of parallel... (4 Replies)
Discussion started by: azar.zorn
4 Replies

4. What is on Your Mind?

Where did you meet UNIX for a first time?

Simple question , where did you meet UNIX OS-es. I started with linux, and then I have meet Solaris and I am all in Solaris right now , almost a year that I am in UNIX, still reading manuals. (35 Replies)
Discussion started by: solaris_user
35 Replies

5. Shell Programming and Scripting

Help on sed requested

Hi I have a problem to resolve, I think sed is the best option, and I am not successful yet. Have a UNIX file which has records as of the 2 character state codes like NY NJ PA DE From the file I need to create this as a variable in the same script or another file -... (7 Replies)
Discussion started by: snair2010
7 Replies

6. Shell Programming and Scripting

Help requested for a script with sed

Hello Folks, I would very much appreciate if I could get help/suggestions on a particular sed usage. I have to write a script to take version info from a version file, compute the image name, print error if the image does not exist. The version file looks like below: " # # version.cfg #... (3 Replies)
Discussion started by: fatimap
3 Replies

7. Solaris

Why in.mpathd errors - performance possibly?

Hello all, Run a search but see no previous queries. Trying to get to the bottom of why a server running Solaris 9 reports the following every other day: lonpcbcfp1:Jun 20 16:33:20 lonpcbcfp1 in.mpathd: missed sending 17 probes cur_time 1478014085 snxt_time 1478015026 snxt_basetime 1478014018... (0 Replies)
Discussion started by: bookiebarton
0 Replies

8. Shell Programming and Scripting

AWK issue--> Help requested

Fairly new scripter so please bare with me if what I have done below is not according to standards. Okay...heres what I am trying to do. I have a pattern that I need to search for in a directory. This gives me a list of files that includes a control file that contains totals of the line nos for... (3 Replies)
Discussion started by: alfredo123
3 Replies

9. Solaris

Your Opinion requested

Ladies/Gentlemen, I am looking for a web-based tool to keep track of my Sun inventory. The following list of fields are fields I would like to store: Root Passwd (needs to be secure) / Hostid / Console Port / IP Address / Platform / Application / Hostname . . . you get the point. Do any of... (4 Replies)
Discussion started by: pc9456
4 Replies
Login or Register to Ask a Question