Error is “sticky” on board4 J3400.


 
Thread Tools Search this Thread
Operating Systems Solaris Error is “sticky” on board4 J3400.
# 1  
Old 08-27-2008
Error is “sticky” on board4 J3400.

I get the following messages in logs :
var/adm/messages:Aug 3 04:35:33 mhs-apps33-d unix: [ID 220797 kern.warning] WARNING: [AFT0] Sticky Softerr encountered on Memory Module Board 4 J3400

/var/adm/messages:Aug 3 04:35:33 mhs-apps33-d SUNW,UltraSPARC-II: [ID 520797 kern.info] [AFT0] errID 0x0007cb98.604a53e5 Corrected Memory Error on Board 4 J3400 is Sticky

/var/adm/messages:Aug 3 04:35:33 mhs-apps33-d SUNW,UltraSPARC-II: [ID 248118 kern.info] [AFT0] errID 0x0007cb98.7ff04f25 Corrected Memory Error on Board 4 J3400 is Persistent

/var/adm/messages:Aug 3 04:35:33 mhs-apps33-d SUNW,UltraSPARC-II: [ID 532138 kern.info] [AFT0] errID 0x0007cb98.7ffa946e Corrected Memory Error on Board 4 J3400 is Persistent

/var/adm/messages:Aug 3 04:35:33 mhs-apps33-d SUNW,UltraSPARC-II: [ID 361962 kern.info] [AFT0] errID 0x0007cb98.80083481 Corrected Memory Error on Board 4 J3400 is Persistent

/var/adm/messages:Aug 3 04:35:34 mhs-apps33-d SUNW,UltraSPARC-II: [ID 651253 kern.info] [AFT0] errID 0x0007cb98.9400e061 Corrected Memory Error on Board 4 J3400 is Persistent

/var/adm/messages:Aug 3 04


Whereas prtdiag shows the belwo output:
System Configuration: Sun Microsystems sun4u 8-slot Sun Enterprise E4500/E5500
System clock frequency: 100 MHz
Memory size: 8192Mb

========================= CPUs =========================

Run Ecache CPU CPU
Brd CPU Module MHz MB Impl. Mask
--- --- ------- ----- ------ ------ ----
0 0 0 400 8.0 US-II 10.0
0 1 1 400 8.0 US-II 10.0
2 4 0 400 8.0 US-II 10.0
2 5 1 400 8.0 US-II 10.0
4 8 0 400 8.0 US-II 10.0
4 9 1 400 8.0 US-II 10.0
6 12 0 400 8.0 US-II 10.0
6 13 1 400 8.0 US-II 10.0


========================= Memory =========================

Intrlv. Intrlv.
Brd Bank MB Status Condition Speed Factor With
--- ----- ---- ------- ---------- ----- ------- -------
0 0 1024 Active OK 60ns 4-way A
0 1 1024 Active OK 60ns 4-way B
2 0 1024 Active OK 60ns 4-way A
2 1 1024 Active OK 60ns 4-way B
4 0 1024 Active OK 60ns 4-way B
4 1 1024 Active OK 60ns 4-way B
6 0 2048 Active OK 60ns 2-way A

========================= IO Cards =========================

Bus Freq
Brd Type MHz Slot Name Model
--- ---- ---- ---------- ---------------------------- --------------------
1 SBus 25 0 lpfs/sd (block) LP9002S
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
1 SBus 25 3 SUNW,hme
1 SBus 25 3 SUNW,fas/sd (block)
1 SBus 25 13 SUNW,socal/sf (scsi-3) 501-3060
3 SBus 25 0 lpfs/sd (block) LP9002S
3 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
3 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
3 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
3 SBus 25 2 SUNW,qfe SUNW,sbus-qfe
3 SBus 25 3 SUNW,hme
3 SBus 25 3 SUNW,fas/sd (block)
3 SBus 25 13 SUNW,socal/sf (scsi-3) 501-3060

No failures found in System
===========================

No System Faults found
======================


is there a fault in the DIMM ie is replacement required
# 2  
Old 08-28-2008
I would suggest scheduling downtime and changing the memory stick that is coming up in the error message. This could worsen and the server could crash at a bad time (in the middle of the day, for instance).

If you can fix it over a weekend or something where it won't impact anything, do it.
# 3  
Old 08-28-2008
Error

The bad mem is in slot J3400. Get it replaced as soonSmilie
# 4  
Old 08-28-2008
I agree with the 2 above responses. "Sticky" means a memory error that the kernel can detect and fix but keeps repeating. That is why prtdiag looks ok - the memory stick hasn't totally failed yet so if the kernel keeps correcting the errors the hardware diags think it is ok.

If it keeps coming up so regularly like this eventually it will become uncorrectable and crash your box. May be in 5 minutes, may be in 5 years. But since you don't know when it will crash the prudent thing is to replace the failing memory ASAP, before it gets any more serious.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. UNIX for Dummies Questions & Answers

Sticky Folders

Hello everyone I've got a shell script that kicks off a number of django web sites. It allocates socket files in a sockets folder that the nginx uses to pass requests upstream. Problem is on my new ubuntu box, the script seems to run but the socket files that are created don't have the... (1 Reply)
Discussion started by: mjdavies
1 Replies

2. UNIX for Advanced & Expert Users

sticky bit

Hi, I understand the purpose of sticky bit on directories. But I am not very clear about what the sticky bit do on a file. Can any one explain me in detail and with example please. Thanks in advance. (1 Reply)
Discussion started by: praveen_b744
1 Replies

3. UNIX for Dummies Questions & Answers

Sticky Bit????

HI What is sticky bit? how can be see if the sticky bit for file is set? WHat is meaning of sticky bit set on Directory? What is the syntax to set the sticky bit? With example Thanks (10 Replies)
Discussion started by: skyineyes
10 Replies

4. UNIX for Dummies Questions & Answers

Sticky Bit

Hi, could anyone please send me a link to learn/ know more about sticky bits? I am still not clear on the application of using a sticky bits. Thanks for your help. Regards, UP (3 Replies)
Discussion started by: teenu18
3 Replies

5. Shell Programming and Scripting

sticky bit

Hi frns, What is command to list out all dir's for which sticky bit has been set. Regards, Manu (2 Replies)
Discussion started by: manu.vmr
2 Replies

6. UNIX for Dummies Questions & Answers

Sticky Bit

I have the sticky bit set on my /tmp directory, but users are still able to remove files that are not owned by them. Does the /etc/group file get invloved in securing these files ?? (1 Reply)
Discussion started by: rob11g
1 Replies

7. UNIX for Dummies Questions & Answers

sticky bit

What command string would you use to set the sticky bit on a directory that you own? (2 Replies)
Discussion started by: mma_buc_98
2 Replies

8. UNIX for Dummies Questions & Answers

sticky bit??

I have a script that I want to be able to let user 'wcs1234' execute it, but when it runs, it will do so under the higher authority of 'cdunix'. It is my understanding that I accomplish this with a sticky bit. I have tried every variation of this but am unable to get this to work. my script is... (2 Replies)
Discussion started by: hedrict
2 Replies

9. UNIX for Dummies Questions & Answers

Sticky bit

I have a questions, whose answer may be very obvious: Of what use is the sticky-bit permission on a Unix system? I have looked at the chmod(1) man page on our HP-UX playground system, and haven't been given much explanation: Add or delete the save-text-image-on-file- execution (sticky... (3 Replies)
Discussion started by: LivinFree
3 Replies
Login or Register to Ask a Question