SocketException : Connection Reset


 
Thread Tools Search this Thread
Special Forums IP Networking SocketException : Connection Reset
# 1  
Old 12-08-2010
SocketException : Connection Reset

Hi,

I am currently working on a Server/Client application on a cluster

I am running about 100 clients in parallel and one server (32 clients per node with 8 processor => 4 clients per processor) . The server is linked to all the clients via Sockets.

After about 55 iteration (about 20 mins), i have a client dying and i receive the following error:

Client 126 java.net.SocketException: Connection reset
at java.net.SocketInputStream.read(SocketInputStream. java:16
at java.net.SocketInputStream.read(SocketInputStream. java:182)
at java.io.ObjectInputStream$PeekInputStream.peek(Obj ectInputStream.java:2249)
at java.io.ObjectInputStream$BlockDataInputStream.pee k(ObjectInputStream.java:2542)
at java.io.ObjectInputStream$BlockDataInputStream.pee kByte(ObjectInputStream.java:2552)
at java.io.ObjectInputStream.readObject0(ObjectInputS tream.java:1297)
at java.io.ObjectInputStream.readObject(ObjectInputSt ream.java:351)
at ServerClient.Server$TcpThread.run(Server.java:789)
Connection reset


the Server.java.789 is : ServerThread.sockIn.readObject();


Do you have any idea what could create the connection reset ? The other clients are still running. It seems that the client socket is creating an exception when we try to read a file which actually does not exist.

My main question is what are the method to identify the problem related to this specific socket (note that I am working on a cluster, hence accessing the node via ssh connection and without having admin access).

I dont understand why suddenly one client would die. It was working well during 20 minutes and suddenly one die ? Do you have any idea and any advice on how to identify the reset of this Socket disconnection ?

I though about using wireshark but the fact that the application is running on a cluster make it difficult to analyze,sniff the packets remotely. I dont know if i can do it remotely.

Do you have any idea how I could debug this problem?

Thanks
masy1800
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Red Hat

Can't SSH - Connection reset by Peer

Hi Guys, I can't SSH to a remote system - connection reset by peer..any ideas ? -The Source is Linux, the Remote is Windows -The remote has OpenSSH running on Port 22 - Telnet confirms port is open -User1 has a RSA2 Key (2048) key, which is capture in the 1010101-pub.key specified by the... (1 Reply)
Discussion started by: stevie_velvet
1 Replies

2. UNIX for Dummies Questions & Answers

cat: write error: Connection reset by peer

I have created a script to cat the contents of a log file and pipe it to head and tail so I can get specific lines. When I do this I sometimes randomly get the error "cat: write error: Connection reset by peer". It is completely sporatic and sometimes it doesnt happen and sometimes it does... (4 Replies)
Discussion started by: atelford
4 Replies

3. Programming

SocketException: Connection reset

Hi, I am currently working on a Server/Client application on a cluster I am running about 100 clients in parallel and one server (32 clients per node with 8 processor => 4 clients per processor) . The server is linked to all the clients via Sockets. After about 55 iteration (about 20... (1 Reply)
Discussion started by: masy1800
1 Replies

4. UNIX for Dummies Questions & Answers

command to reset connection

Hi everyone, Here's question I posted on ubuntu's forum with no success :( command to reset connection - Ubuntu Forums I apologize in advance if i'm not supposed to post a link to another forum's thread but well, rather than copy/pasting... I'm hoping to find some answers here :) ... (3 Replies)
Discussion started by: anthalamus
3 Replies

5. Emergency UNIX and Linux Support

The connection was reset

Hi Folks, I have recently started Apache on my Fedora 11. I placed a working html file in the ServerRoot. I can wget and telnet from within the localhost and access the index.html. My telnet to port 80 from another machine works fine but when I try to access the web page through Firefox I... (8 Replies)
Discussion started by: jacki
8 Replies

6. UNIX for Advanced & Expert Users

Connection reset by peer..closing connection

Hello I'm facing the above problem while doing a performance run. I've a script which I'm launching from my windows desktop using mozilla. The script will invoke backend action on a Solaris host which in turn feeds the records to a driver located on a linux box(Cent OS). What's happening is... (1 Reply)
Discussion started by: subramanyab
1 Replies

7. IP Networking

connection reset by peer on freebsd

hi all. am running postgresql in a vm on debian server. i have some client programs connecting to the db. when i check the logs of postgresql i see "connection reset by peer" can someone help me with this issue... thanks. (1 Reply)
Discussion started by: coolatt
1 Replies

8. Shell Programming and Scripting

Connection reset by peer

hi i am connecting to a remote server using sftp protocol. i am using the command like "sftp USER01@122.10.12.45" then i got the error "Couldn't read packet: Connection reset by peer" please help its very urgent (5 Replies)
Discussion started by: Satyak
5 Replies

9. Shell Programming and Scripting

ssh - connection reset by peer

I use ssh to work on the server (OS X 10.4.x) remotely from home (Debian / OS X 10.5.3). If I leave my machine idle, I get disconnected after a couple of minutes. I get the following error message: Read from remote host "server name": Connection reset by peer Connection to "server name" closed.... (2 Replies)
Discussion started by: osxhawk
2 Replies

10. IP Networking

connection reset by peer

:confused: HI, We have 2 servers ( win 2003 + sun 5.8 ) and we run a rsh from the NT to the UNIX. The program runs a shell script on the UNIX and waits for the answer in the EOF. How ever, after period of appx 2 min there is a reply "Recv failed:Connection reset by peer". while the... (5 Replies)
Discussion started by: eyalush
5 Replies
Login or Register to Ask a Question
ICECREAM 
DAEMON(1) Icecream User's Manual ICECREAM DAEMON(1) NAME
iceccd - Icecream daemon SYNOPSIS
iceccd [-n netname] [-m max_processes] [--no-remote] [-w] [-d | --daemonize] [-l logfile] [-s schedulerhost] [-v [v [v]]] [-r | --run-as-user] [-b env-basedir] [-u | --nobody-uid nobodyuid] [--cache-limit MB] [-N hostname] DESCRIPTION
The Icecream daemon has to run on all nodes being part of the Icecream compile cluster. It receives compile jobs and executes them in a ch- root environment. The compile clients send their compile environment the first time they send a job to a particular daemon, so that the en- vironment of the daemon doesn't have to match the one of the client. The daemon also has to run on clients sending compile jobs to the Icecream network. If a node should be able to send compile jobs, but nev- er receive any, start the daemon with the option -m 0. All Icecream daemons need to have contact to the Icecream scheduler which controls the distribution of data between compile nodes. Normally the daemon will automatically find the right scheduler. If this is not the case you can explicitly specify the name of the Icecream network and the host running the scheduler. OPTIONS
-n netname The name of the icecream network the daemon should connect to. There has to be a scheduler running for the network under the same network name. -m max_processes Maximum number of compile jobs started in parallel on machine running the daemon. --no-remote Prevents jobs from other nodes being scheduled on this one. -w -d, --daemonize Detach daemon from shell. -l logfile Name of file where log output is written to. -s scheduler_host Name of host running the scheduler for the network the daemon should connect to. This option might help if the scheduler can't broadcast its presence to the clients due to firewall settings or similar reasons. -v, -vv, -vvv Control verbosity of daemon. The more v the more verbose. -r, --run-as-user Force running the daemon with user rights. Usually you will need to run the daemon with root rights. -b env_basedir Base directory for storing compile environments sent to the daemon by the compile clients. -u, --nobody-uid nobodyuid Id of user nobody. This user id is used when the daemon is dropping privileges. --cache-limit MB Maximum size in Mega Bytes of cache used to store compile environments of compile clients. -N hostname The name of the icecream host on the network. SEE ALSO
icecream, scheduler, iceccd, icemon AUTHOR
Cornelius Schumacher Icecream April 21th, 2005 ICECREAM DAEMON(1)