The UNIX and Linux Forums  

Go Back   The UNIX and Linux Forums > Top Forums > UNIX for Dummies Questions & Answers
Google UNIX.COM


UNIX for Dummies Questions & Answers If you're not sure where to post a UNIX or Linux question, post it here. All UNIX and Linux newbies welcome !!

More UNIX and Linux Forum Topics You Might Find Helpful
Thread Thread Starter Forum Replies Last Post
How do I know which HBA cards' hardware I have (on Solaris 10) ? ronbarak SUN Solaris 4 03-19-2008 04:27 AM
How do I know which HBA cards' hardware I have (on Solaris 10) ? ronbarak UNIX for Advanced & Expert Users 3 03-05-2008 10:33 PM
Migrating Solaris 9 to different hardware snerta UNIX for Advanced & Expert Users 5 12-24-2006 01:58 AM
hardware support for solaris 9 or 10 rjay.com SUN Solaris 2 12-08-2006 12:24 AM
[need help] about ip hardware error bucci SUN Solaris 1 11-24-2006 08:25 AM

Reply
 
Submit Tools LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 06-20-2005
Registered User
 

Join Date: Feb 2005
Posts: 13
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Solaris hardware error

hi guys,

need some help on this error message.
im running solaris 2.6 on a e3500 and lately i encountered this error:-

lp[28679]: Warning: Received SIGPIPE; continuing
last message repeated 1 time
[AFT0] Multiple Softerrors:
2 Intermittent, 4 Persistent, and 0 Sticky Softerrors accumulated
from Memory Module Board 7 J3300
[AFT0] Enabling verbose CE messages.
[AFT0] errID 0x000054d0.90e01b51 Corrected Memory Error on Board 7 J3300 is Intermittent
[AFT0] errID 0x000054d0.90e01b51 ECC Data Bit 3 was in error and corrected
lp[29236]: Warning: Received SIGPIPE; continuing
lp[28869]: Warning: Received SIGPIPE; continuing
lp[29257]: Warning: Received SIGPIPE; continuing
lp[29506]: Warning: Received SIGPIPE; continuing
lp[29647]: Warning: Received SIGPIPE; continuing
last message repeated 1 time
[AFT0] Corrected Memory Error on CPU18, errID 0x00005869.ace8892a
AFSR 0x00000000.00100000<CE> AFAR 0x00000000.88857730
AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x1000c4e8
UDBH Syndrome 0xc8 Memory Module Board 7 J3300
[AFT0] errID 0x00005869.ace8892a Corrected Memory Error on Board 7 J3300 is Persistent
[AFT0] errID 0x00005869.ace8892a ECC Data Bit 3 was in error and corrected
[AFT0] Corrected Memory Error on CPU19, errID 0x000058c4.581f31c0
AFSR 0x00000000.00100000<CE> AFAR 0x00000000.88857730
AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x1000c4e8
UDBH Syndrome 0xc8 Memory Module Board 7 J3300
[AFT0] errID 0x000058c4.581f31c0 Corrected Memory Error on Board 7 J3300 is Persistent
[AFT0] errID 0x000058c4.581f31c0 ECC Data Bit 3 was in error and corrected AFSR 0x00000000.00100000<CE> AFAR 0x00000000.88857730
AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x1000c4e8
UDBH Syndrome 0xc8 Memory Module Board 7 J3300
[AFT0] errID 0x000058f8.deed0852 Corrected Memory Error on Board 7 J3300 is Persistent
[AFT0] errID 0x000058f8.deed0852 ECC Data Bit 3 was in error and corrected
[AFT0] Corrected Memory Error on CPU14, errID 0x0000597e.b5a4f601
AFSR 0x00000000.00100000<CE> AFAR 0x00000000.88857730
AFSR.PSYND 0x0000(Score 05) AFSR.ETS 0x00 Fault_PC 0x1000c4e8
UDBH Syndrome 0xc8 Memory Module Board 7 J3300
[AFT0] errID 0x0000597e.b5a4f601 Corrected Memory Error on Board 7 J3300 is Persistent
[AFT0] errID 0x0000597e.b5a4f601 ECC Data Bit 3 was in error and corrected
lp[3024]: Warning: Received SIGPIPE; continuing
lp[3185]: Warning: Received SIGPIPE; continuing
last message repeated 1 time
lp[5781]: Warning: Received SIGPIPE; continuing
lp[5885]: Warning: Received SIGPIPE; continuing
lp[5845]: Warning: Received SIGPIPE; continuing
lp[5872]: Warning: Received SIGPIPE; continuing
last message repeated 1 time
lp[7756]: Warning: Received SIGPIPE; continuing
lp[8184]: Warning: Received SIGPIPE; continuing




is there anyone out there who can tell me whats wrong with the machine,i cant go to sunsolve because i dont have sun contract account to solve this problem....it looks like a memory error....

thx in advance....

Last edited by giriplug; 06-20-2005 at 11:43 PM.
Reply With Quote
Forum Sponsor
  #2 (permalink)  
Old 06-20-2005
blowtorch's Avatar
Supporter
 
Join Date: Dec 2004
Location: Singapore
Posts: 2,313
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Quote:
Quote from docs.sun.com - man signal.h
Signal No. Default Action Reason
SIGPIPE 13 Exit Broken Pipe
Now from your syslog output,
Quote:
Originally Posted by giriplug
lp[5781]: Warning: Received SIGPIPE; continuing
lp[5885]: Warning: Received SIGPIPE; continuing
lp[5845]: Warning: Received SIGPIPE; continuing
lp[5872]: Warning: Received SIGPIPE; continuing
The process is probably missing some pipefile that it is trying to read from. You could try shutting down lp and starting it up again.
This definitely does not look like a hardware problem.
Reply With Quote
  #3 (permalink)  
Old 06-21-2005
pressy's Avatar
solaris cultist
 

Join Date: Aug 2003
Location: Vienna / Austria (Europe) [EARTH]
Posts: 706
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
well, it says that your memory is becoming dead:

UDBH Syndrome 0xc8 Memory Module Board 7 J3300
[AFT0] errID 0x000054d0.90e01b51 Corrected Memory Error on Board 7 J3300 is Intermittent
[AFT0] errID 0x000054d0.90e01b51 ECC Data Bit 3 was in error and corrected

that's a memory bank on one of your systemboards.... :
http://www.sun.com/products-n-soluti...02-5032-15.pdf

gP
Reply With Quote
  #4 (permalink)  
Old 06-21-2005
blowtorch's Avatar
Supporter
 
Join Date: Dec 2004
Location: Singapore
Posts: 2,313
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Oops, Pressy, I missed the forest for the trees there!
Reply With Quote
  #5 (permalink)  
Old 06-21-2005
Registered User
 

Join Date: Nov 2003
Location: Minnesota
Posts: 379
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Reddit! Stumble this Post!Spurl this Post!
Unfortunately, due to one of my previous employers thinking that buying production servers from eBay was a good idea, I have TONS of experience with this kind of error.

This is definately a memory error, on dimm J3300 on system board 7. If you notice, it reports CPUs seeing errors several places, but it is a different CPU each time. If the CPU was failing it would always be the same one. But each time it says the memory module the error came from is the same one, which tells you that is the root cause of the error.

Also note, at the top of your output the error was intermittent, but by the bottom the error message said it is persistent. This isn't a good sign . . . Solaris can accomodate occasional memory errors, but if it is the same dimm constantly doing it like that you'll panic your box eventually. I would either replace that memory, or at least remove that bank and run with less memory. Better to run short than crash your box.
Reply With Quote
Google UNIX.COM
Reply

Thread Tools
Display Modes


The 50 most popular UNIX and Linux searches.
Google Search Cloud for The UNIX and Linux Forums
421 service not available, remote server has closed connection ^m automate ftp autosys awk trim bash eval bash for loop boot: cannot open kernel/sparcv9/unix command copy/move folder in unix couldn't set locale correctly curses.h cut command in unix find grep find mtime find null character in a unix file grep multiple lines grep or grep recursive hp-ux ifconfig inaddr_any inappropriate ioctl for device lynx javascript mailx attachment mget mtime ping port remove first character from string in k shell replace space by comma , perl script rsync ftp scp recursive segmentation fault(coredump) sftp script snoop unix solaris change ip address stale nfs file handle syn_sent tar exclude tar extract to folder test: argument expected unix unix .profile unix forum unix forums unix internals unix interview questions unix mtime unix simulator unix.com vi substitute while loop within while loop shell script


All times are GMT -7. The time now is 03:19 AM.


Powered by: vBulletin, Copyright ©2000 - 2006, Jelsoft Enterprises Limited.
The UNIX and Linux Forums Content Copyright ©1993-2008 The CEP Blog All Rights Reserved -Ad Management by RedTyger Visit The Global Fact Book

Content Relevant URLs by vBSEO 3.2.0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101