We are facing the problem of file system corruption on DS8300,we have done very much effort to find out the root cause of problem but we still not get any success, we have AIX 5.3 OS installed on system with latest patches, we have upgraded HBA firmwares, DS8300 firmware, System firmware, Upgraded the Fabric Switches firmwares, recently deployed brand new switches but sill the problem exists, when problem occurs we have to down our live services and unmount the affected file system & repair the file system by fsck utility & then we have to restart the services which results in down time of about 30-40 minutes, we have raised the problem to IBM whenever the problem arise but when they analyzed they haven't find any abnormality they analyzed the PE packages in DS but they didn't find any abnormality. Have anyone received these file system corruption error on DS or any suggestion Idea ?
What is the data or application writing or using the 'corrupt' data?
This could be an application problem.
Does the application use raw filesystems?
Get your application provider in the loop and get them talking to IBM.
HTH.
There are a few things that I know you need to have a look
If you are using the new 450 disks and you are using space efficient flash copy you must upgrade to the latest level.
There was problem on this also disk that are SAN boot root disk will have to be recreated in some cases to recover from this problem. This can be done by remirroring and moving the disk but then the old disk must be removed and recreated on the storage.
The corruption of file system was occurred some times on Application File system & some times on Database file system, We are using Bea Weblogic & Oracle 10g Database, previously the corruption error was occurred with a time span of two weeks, now form last two months we noted that the corruption error came on DS8300 after two weeks We do not have HACMP environment but we are using SDDPCM with MPIO for attaching host, we made the VG's & FS available on one host at a time.
The error which came on host was FILE SYSTEM CORRUPTION when we saw the error report by errpt -a it shows some file name j2imap.c and in the end the name of the effected file system was written.Yes fsck always fix it. When the fsck repairing the file system it display messages like super block mark dirty but Fixed.
We do not have HACMP environment but we are using SDDPCM with MPIO for attaching host, we made the VG's & FS available on one host at a time.
This sounds like asking for trouble: if somehow two machines concurrently access the volumes it will result in corrupted filesystems, probably even if not both systems actually write to the disk. I remember reading the HACMP scripts for taking over the shared volumes from one cluster node to another once (back in the days when disks were SCSI or SSA) and they were an absolute nightmare of low-level device manipulation to avoid such problems.
Verify you really really always access the LUNs only from one system at a time.
mpio sddpcm, two vio-server for non-hacmp, two for hacmp systems
p570 Power 6 (9117-MMA) here for this example
oracle 10g with ocr and asm, oracle 10g on jfs2, many db2 v9.1 on jfs2 with SAP, a lot of java applikations, which are always candidates for damaging filesystems, and no problems
I can't tell you whats wrong on your systems, but I can tell you our settings:
if you need more information, feel free to ask ^^
I would run
to trace read/write errors and files accessed
Hi,
While a tar file was created, the file system got full and there was no message on the tar failure. Then the system was shut down and the administrator says because the file system was full the shut down procedure corrupted the file system. I'm wondering, unix should have given some... (2 Replies)
Is IBM System Director good for collecting error and notifications from IBM servers such as x3250 x336 etc... or Please give me brief description for the purpose of IBM system Director
Thanks in advance (1 Reply)
Hi,
could any one tell is there any test-suite or any idea How to do data corruption validation testing, means there is no any data corruption ?
Regards
Manish (1 Reply)
Hi all,
we have iBM p series server on that 4. 3 operating system is runing.but i need ti install 5.2 or 5.3 then i ahve to install oracle 10g release 2 .but we have only 1 GB of RAM.can i install 5.2 or 5.3 with same RAM and please send me a document which discribe about how to install... (5 Replies)
Hi,
When i run the code in solaris unix machine, the file from remote server is getting downloaded. but when i use the same code in IBM AIX remote machine, it is not running. It is saying "Erro during scp transfer." Below is the code.
Please give some resolution.
SCPClient client = new... (1 Reply)
Hi,
All of a sudden I landed in a strange problem.
I was working with my C source code in vi editor. I did a wq! and
when reopened, the file is full of "data".. I mean the text contents are gone!!.
I believe this is a file corruption. I have tried the -r option with vi, but no success.
... (5 Replies)