I have identical M5000 machines that are needing to transfer very large amounts of data between them. These are fully loaded machines, and I've already checked IO, memory usage, etc... I get poor network performance even when the machines are idle or copying via loopback. The 10 GB NICs are setup in aggregates, so we should be seeing some serious performance out of these. However, I'm only seeing about 30 MB/sec. These are brand new Solaris 11.1 installs, and fully patched.
I'm getting 30 MB/sec no matter which connection I use, whether it's 1 GB and 10 GB connections, physical, VLAN, etc...
For example, when doing an SCP of a 200 MB file it takes 8 seconds and runs at 30 MB/sec:
I figured maybe it was something to do with the network, but I see this slow speed no matter which interface I use -- even localhost. I tried doing a local SCP back to the same machine, and it's taking 8 seconds there too even though it isn't actually hitting the network cards. If I do it as a regular CP it copies the file in less than a second.
Does anyone have any ideas on how to improve the performance? I figure I can't be the only one who has run into this, but my Google searches haven't turned up anything.
these kind of issues i used to see when the network device was set to auto-negotiate the speed/duplex ... see if editing the config files of the necessary interfaces (i.e., /kernel/drv/prim1.conf) to eliminate auto-negotiation helps ...
you may be right about the loopback but i cannot say for sure as i do not have access to those kind of machines -- maybe somebody else here can explain further ...
you may also want to see the configurations of the appropriate network devices on your other solaris servers and their appropriate ports on the switch if they are able to send the big files much faster ...
btw, i had my network group actually hard set the affected ports on the switch to run 100/full so my servers' network ports were not continually auto-negotiating ... the standards at that company was also to set every network port to auto but they had to make an exception for my production servers ...
if your network group balks at the request to hard set the ports, call up oracle support to see if they have better ideas if nobody else here has one ...
You're right, I may have to turn to support. There are just so many possible factors that they often seem to point the finger at "the other guy..."
I did notice one interesting thing. When I open multiple SCP sessions the speed of the throughput jumps to 60 MB/sec with two transfers, 90 MB/sec with three, etc.... CPU and memory never cap out.
The problem turned out to be LACP. The aggregates weren't communicating properly as LACP pairs due to both a configuration problem on the switch and the servers didn't have have passive mode enabled. After I fixed the server end, and the network guys fixed the switch end the speeds went up exponentially.
Hi,
I have 2 machines in production environment:
1. redhat machine for application
2. DB machine (oracle)
The application doing a lot of small read&writes from and to the DB machine.
The problem is that after some few hours the network from the application to the DB becomes very slow and... (4 Replies)
My code
Hi All,
I am having redhat linux 5.3 (Tikanga) with GFS file system and its very very slow for executing ls -ls command also.Please see the below for 2minits 12 second takes.
Please help me to fix the issue.
$ sudo time ls -la BadFiles |wc -l
0.01user 0.26system... (3 Replies)
There is a big problem with the server (VPS based on OpenVZ, CentOS 5, 3GB RAM). The problem is the following. The first 15-20 minutes after starting the server is operating normally, the load average is less than or about 1.0, but then begins to increase sharply% wa, then hovers around 95-99%.... (2 Replies)
Please, I need help tuning my script. It works but it's too slow.
The code reads an acivity log file with 50.000 - 100.000 lines and filters error messages from it. The data in the actlog file look similar to this:
02/08/2011 00:25:01,ANR2034E QUERY MOUNT: No match found using this criteria.... (5 Replies)
hi guys
We are seeing weird issues on my Linux Suse 10, it has lotus 8.5
and 1 filesystem for OS and another for Lotus Database.
the issue is when the Lotus service starts wait on top is very high about 25% percent and in general CPU usage is very high
we found that when this happens if we... (0 Replies)
Hi all
We have got issues with copying a 2.6 GB file from one folder to another folder.
Well, this is not the first issue we are having on the box currently, i will try to explain everything we have done from the past 2 days.
We got a message 2 days back saying that our Production is 98%... (3 Replies)
We have an egrep search in a while loop.
egrep -w "$key" ${PICKUP_DIR}/new_update >> ${PICKUP_DIR}/update_record_new
${PICKUP_DIR}/new_update is 210 MB file
In each iteration, the egrep on an average takes around 50-60 seconds to search. Ther'es nothing significant in the loop other... (7 Replies)
Discussion started by: hidnana
7 Replies
8. Post Here to Contact Site Administrators and Moderators