NFS mount hangs


 
Thread Tools Search this Thread
Top Forums UNIX for Advanced & Expert Users NFS mount hangs
# 1  
Old 07-15-2008
NFS mount hangs

Last week I've installed Windows Services For UNIX on a Windows 2003 Server test machine (s2003), and exported a folder through NFS. The share is named "storage", and is used as a temporary place for very large export/dump files. Unfortunately, this is the only machine where I have access to a big, inexpensive disk and is accessible from both production and test network environments.

I was able to mount the nfs share in read/write mode on two Solaris' servers, using the command:
Code:
mount -F nfs -o hard,rsize=32768,wsize=32768 s2003:/storage /storage

The two machines are called, respectively:
- eprod, SunOS 5.8
- etest, SunOS 5.10

I've intensively used the mounted file system without any problems, from both machines. But today, while issuing a "ls" command on eprod, I've noticed the message:

Code:
NFS server s2003 not responding still trying

On etest, I didn't have any issue and the remote share is still mounted and working.

So I thought that maybe there was a network problem in the weekend, and tried to umount and remount "/storage" on eprod:

Code:
eprod/root> umount s2003:/storage
nfs umount: /storage: is busy

eprod/root> fuser /storage        # The command hangs indefinitely (interrupted with ^C)
/storage:

eprod/root> fuser -c /storage     # The command works and it doesn't report anything
/storage:

So, I've tried the hard way, and it worked:
Code:
umount -f /storage

The problem now is that I am not able to mount the filesystem anymore:
Code:
eprod/root> mount -F nfs -o hard,rsize=32768,wsize=32768 s2003:/storage /storage
NFS server s2003 not responding still trying

The command "hangs" indefinitely... I've already performed basic connection tests from eprod to s2003:
- ping works
- telnet s2003 on port 2049 works

I've also tried to monitor network traffic with netstat and snoop while issuing the mount command:

Code:
eprod/root> netstat -a | grep s2003
eprod.login         s2003.nfsd          0      0 24820      0 SYN_SENT
eprod.53759         s2003.nfsd      65415      0 24820      0 TIME_WAIT

Code:
eprod/root> snoop s2003
Using device /dev/ce (promiscuous mode)
       eprod ->  s2003   PORTMAP C GETPORT prog=100005 (MOUNT) vers=3 proto=UDP
       s2003 ->  eprod   PORTMAP R GETPORT port=1048
       eprod ->  s2003   MOUNT3 C Null
       s2003 ->  eprod   MOUNT3 R Null 
       eprod ->  s2003   MOUNT3 C Mount /storage
       s2003 ->  eprod   MOUNT3 R Mount OK FH=7593 Auth=none,unix
       eprod ->  s2003   PORTMAP C GETPORT prog=100003 (NFS) vers=3 proto=TCP
       s2003 ->  eprod   PORTMAP R GETPORT port=2049
       eprod ->  s2003   TCP D=2049 S=51997 Syn Seq=1199030867 Len=0 Win=24820 Options=<nop,nop,sackOK,mss 1460>
       s2003 ->  eprod   TCP D=51997 S=2049 Syn Ack=1199030868 Seq=2849360585 Len=0 Win=16384 Options=<mss 1460,nop,nop,sackOK>
       eprod ->  s2003   TCP D=2049 S=51997     Ack=2849360586 Seq=1199030868 Len=0 Win=24820
       eprod ->  s2003   NFS C NULL3
       s2003 ->  eprod   NFS R NULL3 
       eprod ->  s2003   TCP D=2049 S=51997     Ack=2849360614 Seq=1199030988 Len=0 Win=24820
       eprod ->  s2003   TCP D=2049 S=51997 Fin Ack=2849360614 Seq=1199030988 Len=0 Win=24820
       s2003 ->  eprod   TCP D=51997 S=2049     Ack=1199030989 Seq=2849360614 Len=0 Win=65415
       s2003 ->  eprod   TCP D=51997 S=2049 Fin Ack=1199030989 Seq=2849360614 Len=0 Win=65415
       eprod ->  s2003   TCP D=2049 S=51997     Ack=2849360615 Seq=1199030989 Len=0 Win=24820
       eprod ->  s2003   RLOGIN R port=2049 
       eprod ->  s2003   RLOGIN R port=2049 
       eprod ->  s2003   RLOGIN R port=2049 
       ...
       eprod ->  s2003   RLOGIN R port=2049 
       eprod ->  s2003   RLOGIN R port=2049 
       ...
       ...
       ... and so on

I exclude possible problems on s2003, because I can mount "/storage" folder from other machines on the same network of eprod.

I've also found that using udp protocol instead of tcp (default) works:
Code:
mount -F nfs -o hard,rsize=32768,wsize=32768,proto=udp s2003:/storage /storage

What could be the problem? How can I solve this issue?

Thanks in advance!

Smilie
# 2  
Old 07-15-2008
Did you try to stop/start the NFS Demon service on Windows? Maybe it's "locking" the connection for just those 2 hosts.

You can also check on the Sun boxes, if there is a /var/lib/nfs/rmtab (or something similar) which you might want to empty or delete entries for the 2 specific Sun boxes. Maybe there is some similar file on the Windows server which you can edit.

You can also check:
Linux NFS faq

If this doesn't help and you already tested a lot you might think of not using NFS (I don't like it, too many problems on some machines) maybe try it with normal windows shares (SMB) and install/use a Samba client on the Sun boxes?
# 3  
Old 07-15-2008
Yes, I've already stopped and restarted the NFS service on Windows, and on Solaris boxes "/etc/rmtab" is empty, even on etest where the filesystem is mounted.

About Samba, as far as I know there is no Samba Client fo SunOS, except Sharity or Sharity Light which I've played with some time ago, with a little success..... However, since eprod is a production system, I can't and I don't want to install anything on it, also because I don't need a persistent shared directory: it's only for temporary staging a big amount of data, after that I will umount the shared filesystem from both Solaris boxes.
# 4  
Old 07-15-2008
Are you going to be using unix services for windows in production? Doesn't seem like a very wise decision if so, for reasons so obvious I won't go into them.

I've seen this behavior (udp based nfs working when tcp based nfs fails) on an extremely
busy network segment on an overutilized client. Are you monitoring these hosts via snmp?
# 5  
Old 07-16-2008
Quote:
Originally Posted by ramen_noodle
Are you going to be using unix services for windows in production? Doesn't seem like a very wise decision if so, for reasons so obvious I won't go into them.
You are right, in fact I want to use that shared storage only for a big and not-so-critical data transfer between production and test environments, and then umount it. Unfortunately, I don't have enough disk space neither on eprod nor on etest, otherwise I should have created the files I need on eprod and then transferred them via ftp/scp on etest. With a shared storage, I also eliminate the need of a long data transfer.

I am not monitoring the network but I can't exclude that, for a short period, there could be a high load on s2003 or a network congestion. Given that, I can't believe that now I can't repair this situation, it sounds so strange...
# 6  
Old 07-16-2008
Why don't you post the output of netstat on both hosts, restart mountd and portmap (or analogues) on the server, and then restart portmap on the client and attempt to remount. it could just be that the windows nfsd and unix-like rpc are buggy.
# 7  
Old 07-16-2008
I think that the only remaining step I didn't tried yet is restarting nfs client and/or related processes on eprod. Unfortunately I don't have the knowledge to do that.

Could you provide some commands for restarting nfs client service on Solaris 8 (and 10)? I've googled around but my greatest concern is disabling something vital or hampering current connections between clients and eprod, because also on eprod there could be something shared with nfs that other clients may be accessing.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Mount NFS Share On NFS Client via bash script.

I need a help of good people with effective bash script to mount nfs shared, By the way I did the searches, since i haven't found that someone wrote a script like this in the past, I'm sure it will serve more people. The scenario as follow: An NFS Client with Daily CRON , running bash script... (4 Replies)
Discussion started by: Brian.t
4 Replies

2. UNIX for Dummies Questions & Answers

Mount NFS

Hi I have 2 servers, one is running on SunOS SPARC and the other is running on GNU Linux. Since my Linux is very poor I am not sure on how to mount a NFS from SunOS to Gnu Linux. Please help! FS at Host Server: root@rocux4 # ls -ld /data/PCFILES/GTPROD/DWHFILES.NEW drwxrwxrwx 4 nobody ... (6 Replies)
Discussion started by: hedkandi
6 Replies

3. IP Networking

Can't see home folder on one NFS mount but can in another mount on another share

Hello, I have a few Ubuntu 9.10 laptops I'm trying to learn NFS sharing with. I am just experimenting on this right now, so no harsh words about the security of what I'm playing with, please ;) Below are the configs /etc/exports on host /home/woodnt/Homeschool... (2 Replies)
Discussion started by: Narnie
2 Replies

4. OS X (Apple)

NFS Mount

Hello! Im trying to mount an nfs share. But got a bit of a problem. The problem im having is that i try to mount The problem is that the my.server.com/pub/home is owned by root. I know i can make a mount point my.server.com://pub/home/username - > /pub/home But that means i have to do... (1 Reply)
Discussion started by: dozy
1 Replies

5. Red Hat

Redhat nfs client hangs after 5 minutes

New at NFS. 3 machines at hand 1 RedHat nfs Client which is connected to a spectrometer (chemistry Instrument): # uname -a Linux BH083108 2.6.9-78.0.17.ELsmp #1 SMP Thu Mar 5 04:52:17 EST 2009 i686 i686 i386 GNU/Linux # cat /proc/version Linux version 2.6.9-78.0.17.ELsmp... (9 Replies)
Discussion started by: hpg4815
9 Replies

6. Solaris

NFS mount. Nobody Nobody permissions.

Hi, I have an NFS mount on a Solaris 10 server with file permissions nobody:nobody. This is an NFS export from an Netapp filer. Export attributes from the filer are below. Anonymous User ID=0 Read-Write Access (All Hosts) Security (sys) I have other NFS exports set up with the same... (7 Replies)
Discussion started by: gwhelan
7 Replies

7. UNIX for Dummies Questions & Answers

NFS mount

I have a NAS server that needs to have the share mounted on a unix server. I am not that familiar with NFS mounts....any help on what I have to do on the UNIX box...thanks! (7 Replies)
Discussion started by: andrewd
7 Replies

8. UNIX for Dummies Questions & Answers

mount -o llock -F nfs vs mount -F nfs

Hi, We encountered NFS issue (solaris) especially running on Oracle application. Problem such as forms hang when close button is click, concurrent job shows running status all time. Understand we need to use mount -o llock -F nfs instead of mount -F nfs to eliminate? this problem.. Can... (1 Reply)
Discussion started by: KhawHL
1 Replies

9. Filesystems, Disks and Memory

NFS mount

Hi All, Can any one help in Mounting remote filesystems with NFS? Thanks in advance. Regards, Suresh. (4 Replies)
Discussion started by: srvaka1
4 Replies

10. UNIX for Dummies Questions & Answers

Mount nfs filesystem

I try to share a directory from a wortstation to a server. the share command was no problem. Solaris 8 on mars share -F nfs -o rw /dir/dir2 on the server mount mars:/dir/dir2 /mount_point RPC: Program not registered <--- What is the meaning of this ? Thanks for you help ! (2 Replies)
Discussion started by: joerg
2 Replies
Login or Register to Ask a Question