SocketException: Connection reset


 
Thread Tools Search this Thread
Top Forums Programming SocketException: Connection reset
# 1  
Old 12-07-2010
SocketException: Connection reset

Hi,

I am currently working on a Server/Client application on a cluster

I am running about 100 clients in parallel and one server (32 clients per node with 8 processor => 4 clients per processor) . The server is linked to all the clients via Sockets.

After about 55 iteration (about 20 mins), i have a client dying and i receive the following error:

Client 126 java.net.SocketException: Connection reset
at java.net.SocketInputStream.read(SocketInputStream. java:16Image
at java.net.SocketInputStream.read(SocketInputStream. java:182)
at java.io.ObjectInputStream$PeekInputStream.peek(Obj ectInputStream.java:2249)
at java.io.ObjectInputStream$BlockDataInputStream.pee k(ObjectInputStream.java:2542)
at java.io.ObjectInputStream$BlockDataInputStream.pee kByte(ObjectInputStream.java:2552)
at java.io.ObjectInputStream.readObject0(ObjectInputS tream.java:1297)
at java.io.ObjectInputStream.readObject(ObjectInputSt ream.java:351)
at ServerClient.Server$TcpThread.run(Server.java:789)
Connection reset


the Server.java.789 is : ServerThread.sockIn.readObject();


Do you have any idea what could create the connection reset ? The other clients are still running. It seems that the client socket is creating an exception when we try to read a file which actually does not exist.

My main question is what are the method to identify the problem related to this specific socket (note that I am working on a cluster, hence accessing the node via ssh connection and without having admin access).

I dont understand why suddenly one client would die. It was working well during 20 minutes and suddenly one die ? Do you have any idea and any advice on how to identify the reset of this Socket disconnection ?

I though about using wireshark but the fact that the application is running on a cluster make it difficult to analyze,sniff the packets remotely. I dont know if i can do it remotely.

Do you have any idea how I could debug this problem?

Thanks
masy1800
# 2  
Old 12-08-2010
Perhaps an exception got thrown in read(), on the server side, that you didn't catch?
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Red Hat

Can't SSH - Connection reset by Peer

Hi Guys, I can't SSH to a remote system - connection reset by peer..any ideas ? -The Source is Linux, the Remote is Windows -The remote has OpenSSH running on Port 22 - Telnet confirms port is open -User1 has a RSA2 Key (2048) key, which is capture in the 1010101-pub.key specified by the... (1 Reply)
Discussion started by: stevie_velvet
1 Replies

2. UNIX for Dummies Questions & Answers

cat: write error: Connection reset by peer

I have created a script to cat the contents of a log file and pipe it to head and tail so I can get specific lines. When I do this I sometimes randomly get the error "cat: write error: Connection reset by peer". It is completely sporatic and sometimes it doesnt happen and sometimes it does... (4 Replies)
Discussion started by: atelford
4 Replies

3. IP Networking

SocketException : Connection Reset

Hi, I am currently working on a Server/Client application on a cluster I am running about 100 clients in parallel and one server (32 clients per node with 8 processor => 4 clients per processor) . The server is linked to all the clients via Sockets. After about 55 iteration (about 20... (0 Replies)
Discussion started by: masy1800
0 Replies

4. UNIX for Dummies Questions & Answers

command to reset connection

Hi everyone, Here's question I posted on ubuntu's forum with no success :( command to reset connection - Ubuntu Forums I apologize in advance if i'm not supposed to post a link to another forum's thread but well, rather than copy/pasting... I'm hoping to find some answers here :) ... (3 Replies)
Discussion started by: anthalamus
3 Replies

5. Emergency UNIX and Linux Support

The connection was reset

Hi Folks, I have recently started Apache on my Fedora 11. I placed a working html file in the ServerRoot. I can wget and telnet from within the localhost and access the index.html. My telnet to port 80 from another machine works fine but when I try to access the web page through Firefox I... (8 Replies)
Discussion started by: jacki
8 Replies

6. UNIX for Advanced & Expert Users

Connection reset by peer..closing connection

Hello I'm facing the above problem while doing a performance run. I've a script which I'm launching from my windows desktop using mozilla. The script will invoke backend action on a Solaris host which in turn feeds the records to a driver located on a linux box(Cent OS). What's happening is... (1 Reply)
Discussion started by: subramanyab
1 Replies

7. IP Networking

connection reset by peer on freebsd

hi all. am running postgresql in a vm on debian server. i have some client programs connecting to the db. when i check the logs of postgresql i see "connection reset by peer" can someone help me with this issue... thanks. (1 Reply)
Discussion started by: coolatt
1 Replies

8. Shell Programming and Scripting

Connection reset by peer

hi i am connecting to a remote server using sftp protocol. i am using the command like "sftp USER01@122.10.12.45" then i got the error "Couldn't read packet: Connection reset by peer" please help its very urgent (5 Replies)
Discussion started by: Satyak
5 Replies

9. Shell Programming and Scripting

ssh - connection reset by peer

I use ssh to work on the server (OS X 10.4.x) remotely from home (Debian / OS X 10.5.3). If I leave my machine idle, I get disconnected after a couple of minutes. I get the following error message: Read from remote host "server name": Connection reset by peer Connection to "server name" closed.... (2 Replies)
Discussion started by: osxhawk
2 Replies

10. IP Networking

connection reset by peer

:confused: HI, We have 2 servers ( win 2003 + sun 5.8 ) and we run a rsh from the NT to the UNIX. The program runs a shell script on the UNIX and waits for the answer in the EOF. How ever, after period of appx 2 min there is a reply "Recv failed:Connection reset by peer". while the... (5 Replies)
Discussion started by: eyalush
5 Replies
Login or Register to Ask a Question