packet loss problem


 
Thread Tools Search this Thread
Special Forums IP Networking packet loss problem
# 1  
Old 01-07-2009
packet loss problem

I have 4 network ports on our T5240 sun server.

all but 1 gives packet losses (nxge1)

nxge0 gives on average 50% packet loss, very bad.

nxge2 gives on average 1-2% packet loss.

nxge3 gives on average 20% packet loss.

Is there a tool or something to help me find the problem?
# 2  
Old 01-07-2009
yes there is, it's called brain 1.0 and it's a realy usefull tool Smilie.

the nxge interfaces are sensitive and often have problems in many environments. please be sure to use the latest solaris patchcluster to have the latest nxge drivers on your system. also check for autoneg problems (are you on a cisco switch?). have a look with "dladm show-dev" to know your settings and maybe try to switch to 1000fdx fixed (ndd get/set is your friend to try this out).

hth,
DN2
# 3  
Old 01-16-2009
I don't have that version of brain, I only have the beta version.Smilie

The beta version tells me there may be a configuration issue with
the data center connection or a down right misconfiguration.Smilie

The switch interface to data center shows a lot of collisions. My
switch interface was set to auto and it showed data center at
half duplex, but when I contacted them they said their side was
set to full duplex. He set it to auto and my side went to full
duplex automatically.Smilie

They also mentioned they saw errors and problems on their end.Smilie
# 4  
Old 01-16-2009
When one partner is at full duplex and the other partner is at auto-negotiate, the partner at auto-negotiate will misread its partner and establish the wrong duplex. This should be guaranteed to fail in this manner.
# 5  
Old 01-16-2009
Before they made the change to AUTO duplex, I made the change to FULL on
my side and it drooped the connection Smilie and went to redundant connection,
which was also showing as HALF duplex on their side, my side set to AUTO for
the redundant interface.

So if they were indeed at FULL DUPLEX it would NOT have dropped the
connection and should have been FULL to FULL, but that is not the case.

When I changed back to AUTO it said HALF duplex again on their side. Only
when they changed to AUTO did my side show them as FULL duplex.

Why did my switch say HALF for data center when my switch was set to
AUTO duplex and data center side was set to FULL?Smilie

Why would connection drop if supposedly both sides where set to FULL?Smilie

This is all very frustrating, the data center is seeing problems on their end
but not disclosing that information to me and suggesting it is my problem.Smilie
# 6  
Old 01-16-2009
Don't worry Photon, you are not the first by a long way.
Auto-negotiation of LAN speed and duplex settings does not work at all well
with servers. It can appear to work and then go very wrong after a power fail.

Perderabo advice is good.

It takes two to tango.

BOTH sides need to be hard set to NOT auto-negotiate.

1) Network
2) Servers

After the network guy have hard-set their LAN ports,
please positively configure your end to match.
Avoid auto-negotiation. Avoid half-duplex.

Then you need a one-time totally cold start.
i.e. Shutdown the server. Wait 5 mins. Cold start the server.

If this does not clear the fault then it's time to call the network engineer.


Hope this helps.

Last edited by methyl; 01-16-2009 at 09:33 PM.. Reason: minor typo
# 7  
Old 01-16-2009
Quote:
Originally Posted by photon
So if they were indeed at FULL DUPLEX it would NOT have dropped the connection and should have been FULL to FULL, but that is not the case.
The connection would have only dropped if the speeds were mis-matched. Duplex should not have caused this.

Quote:
When I changed back to AUTO it said HALF duplex again on their side. Only when they changed to AUTO did my side show them as FULL duplex.

Why did my switch say HALF for data center when my switch was set to
AUTO duplex and data center side was set to FULL?Smilie

Why would connection drop if supposedly both sides where set to FULL?Smilie
Duplex is simple to understand. If one side is set to AUTO it expects the other side to be set to AUTO also.

Think of it as a 3-way hand shake for a connection. First side asks the Second what he can run at. The Second side says I can do Full, the First side says ok we will do Full. If both are set to AUTO then they decide at what Duplex they will connect at, which is normally FULL.

If one side is hard coded it ignores request for duplex settings thus the side set to AUTO defaults to Half.

It is not normal to set servers to AUTO. AUTO should only be used when you don't know what might be plugged into the port.

Quote:
This is all very frustrating, the data center is seeing problems on their end but not disclosing that information to me and suggesting it is my problem.Smilie
If they are really seeing issue and they believe it is you then they should be able to provide details as to why it is your side with the problem.

They should be able to provide you with the port settings from the running config on the switch. I would ask for this.
Login or Register to Ask a Question

Previous Thread | Next Thread

7 More Discussions You Might Find Interesting

1. AIX

Packet loss coming with big packet size ping

(5 Replies)
Discussion started by: Vishal_dba
5 Replies

2. Solaris

Packet loss on ce interface.

Hi, I am using the ce interface on my Solaris 9 server and there is significant packet loss when transmitting large packets. Does anyone have a fix for this? ----10.1.0.0 PING Statistics---- 51 packets transmitted, 42 packets received, 17% packet loss round-trip (ms) min/avg/max =... (12 Replies)
Discussion started by: sparcman
12 Replies

3. UNIX for Dummies Questions & Answers

Problem when I try to install a wireshark packet

Hi Gurus of UNIX, I has a problem when I try to install a packet in my virtual box. (I install solaris in it) Any want can help whith it: The problem is the following # pkgadd -d wireshark-1.2.10-sol10-x86-local The following packages are available: 1 SMCwires wireshark ... (5 Replies)
Discussion started by: andresguillen
5 Replies

4. IP Networking

Problem Receiving the first OSPF packet

I trying to send and receive OSPF packets. I am using RAW Sockets(socket(AF_INET, SOCK_RAW, IPPROTO_OSPF)) to do this. I am successfully able to send an OSPF Hello packet however I am not able to receive a OSPF packet if I have not sent an OSPF packet earlier on the RAW SOCKET. Scenario: ... (3 Replies)
Discussion started by: cosmic_egg
3 Replies

5. Ubuntu

packet inconsistency problem

Hello everyone, I was trying to install db2 on Ubuntu, but got messed up with manual installation and Synaptic. At the moment, I find myself with a filesystem where DB2 is NOT installed ( I removed it with a sudo rm :o ) and with Synaptic still flagging db2exc as installed. The problem is that... (1 Reply)
Discussion started by: clalfa
1 Replies

6. UNIX for Advanced & Expert Users

Response time under packet loss

I am experiencing a problem where under a dial condition I am experiencing packet loss, which is failrly normal, but the response to the packet loss is taking bewteen 6 and 10 seconds. Could someone please advise what the industry standard is on the response time under a packet loss senario. (1 Reply)
Discussion started by: shane
1 Replies

7. Shell Programming and Scripting

Detecting data loss during FTP

Hi, How we can detect that there has been a data loss during FTP, throught Shell scripting? I have gone through FTP return codes, but, none indicate that there has been any data loss. Can we use FTP return code 226 as an indication that during file transfer there has been no data loss? If,... (4 Replies)
Discussion started by: sameerbo
4 Replies
Login or Register to Ask a Question