Sponsored Content
Full Discussion: Solaris hardware error
Top Forums UNIX for Dummies Questions & Answers Solaris hardware error Post 75656 by rhfrommn on Tuesday 21st of June 2005 10:30:50 AM
Old 06-21-2005
Unfortunately, due to one of my previous employers thinking that buying production servers from eBay was a good idea, I have TONS of experience with this kind of error. Smilie

This is definately a memory error, on dimm J3300 on system board 7. If you notice, it reports CPUs seeing errors several places, but it is a different CPU each time. If the CPU was failing it would always be the same one. But each time it says the memory module the error came from is the same one, which tells you that is the root cause of the error.

Also note, at the top of your output the error was intermittent, but by the bottom the error message said it is persistent. This isn't a good sign . . . Solaris can accomodate occasional memory errors, but if it is the same dimm constantly doing it like that you'll panic your box eventually. I would either replace that memory, or at least remove that bank and run with less memory. Better to run short than crash your box.
 

9 More Discussions You Might Find Interesting

1. Solaris

hardware support for solaris 9 or 10

Hi boss, i want to purches one second hand system intel P3 for practice.vender having mercury mother board.it will support sun 9 or 10. if any problem will create while configure like (ex:-apache) .tell me which configuration (intel) is best for practice.In intel system what are the... (2 Replies)
Discussion started by: rjay.com
2 Replies

2. UNIX for Advanced & Expert Users

Migrating Solaris 9 to different hardware

Hi, I am new to this forum and hope someone can help. Does anyone know how to restore a Solaris 9 backup tape from server A to a completely different server B hardware. Both boxes are sun 64bit. Your help is much appreciated. Thank you Jan. (5 Replies)
Discussion started by: snerta
5 Replies

3. Solaris

Solaris Hardware and ROHS

I'm trying to determine the End of Service Life for some of our Solaris servers, and I'm not sure if our servers are ROHS compliant. Is there a command to determine if the server is ROHS? If not, is there somewhere on the chasis where I can find this information? (2 Replies)
Discussion started by: dangral
2 Replies

4. UNIX for Advanced & Expert Users

How do I know which HBA cards' hardware I have (on Solaris 10) ?

Hi, I'm trying to determine which HBA cards are installed on my Solaris 10 (Sun-Fire-V240) machine. The relevant data I have is below, but from it I cannot ascertain for sure if I have Emulex 10000 or 11000 HBA cards. Can anyone suggest how to determine my HBA hardware ? Thanks, Ron. ... (3 Replies)
Discussion started by: ronbarak
3 Replies

5. Solaris

How do I know which HBA cards' hardware I have (on Solaris 10) ?

Hi, I'm trying to determine which HBA cards are installed on my Solaris 10 (Sun-Fire-V240) machine. The relevant data I have is below, but from it I cannot ascertain for sure if I have Emulex 10000 or 11000 HBA cards. Can anyone suggest how to determine my HBA hardware ? Thanks, Ron. ... (4 Replies)
Discussion started by: ronbarak
4 Replies

6. Solaris

regarding sun solaris hardware

hi :), i am new to this forum and i am in need of some help. one of my friend i having a Sun Ultra 2 UPA/Sbus (UltraSPARC-II 296Mhz) 640 MB ram. i am not sure if that configuration will be helpful to try out some sysadmin commands. this machine has a 21-inch monster monitor, which i think... (5 Replies)
Discussion started by: sudhiroracle
5 Replies

7. UNIX for Advanced & Expert Users

solaris 9 hardware mulfunction

Hello, Im working on solaris 9 and I need to write script which monitoring several hardware componenets for any failures such as memory , Disks , power supply etc. I using prtdiag to extract this info. What should I check in the output for : Memory (Is the block of "Memory Module Groups"... (1 Reply)
Discussion started by: Alalush
1 Replies

8. Solaris

solaris X86 Hardware Checkup (HP HW)

Hi ... i am new about HP -Solaris 10 x86 I have fresh hardware, I need to check the RAM , CPU, BUS , Hard drives mounted Can you please help me with commnds. Thanks in advance (3 Replies)
Discussion started by: anand87
3 Replies

9. Solaris

Solaris 11 no sound despite finding hardware

Hello Everyone, I'm new to Solaris, less than a week to give an idea how green I am.:eek: Although new to UNIX, I've been running Linux (i.e. OpenSUSE, Ultimate Edition, Arch, and obviously Ubuntu) for many years, so, I decided to put Solaris 11 on my 12 core opteron. I had a bit of difficulty... (23 Replies)
Discussion started by: Nostradamus1973
23 Replies
MCE-INJECT(8)						  Linux's Administrator's Manual					     MCE-INJECT(8)

NAME
mce-inject - Inject machine checks into x86 kernel for testing SYNOPSIS
mce-inject mce-file1 ... mce-inject < mce-file DESCRIPTION
mce-inject allows to inject machine checks in a running x86 Linux kernel. machine checks are internal CPU errors. This tool is intended to test the machine check handling code. The injection only happens on the software level and does not simulate full machine check handling on the platform level. The machine checks to be injected are described in a input language similar to the format the kernel outputs on a panic message (with a few extension). In general you should be able to pipe in a panic message to inject that mce. See the IA32 SDM Vol3a chapter 14 for the details on fields. They correspond to the machine check MSRs of the standard IA32 Machine check architecture described there. INPUT LANGUAGE
These generally represent the similarly CPU MSRs, with some metadata on where to inject the exception. Please see the IA software develop- ers Manual for more details. The machine check always starts with MCE or CPU number|broadcast [bank-number] Machine check is injected on CPU xx or broadcasted to all CPUs BANK bank-number Machine check bank to inject on. STATUS {number|fatal|corrected|uncorrected|pcc|uc|val|en|over|ar|ucna|srao|srar|s} The symbols generally represent hardware register bits, as documented in the Intel software developer's manual. When multiple are listed they are ored together. MCGSTATUS {number|mcip|ripv|eipv} The symbols generally represent hardware register bits, as documented in the Intel software developer's manual. When multiple are listed they are ored together. ADDR number RIP number RIP number:<number>{string} RIP number:number MISC number CPU number : machine check exception : number BANK number : number parsed for compatibility with kernel output. Use explicit statements TSC number TSC state injected in machine check NOBROADCAST Don't broadcast exceptions to all CPUs. Default is to broadcast UC IRQBROADCAST Broadcast IRQ to all CPUs that not getting error injection. NMIBROADCAST Broadcast NMI to all CPUs that not getting error injection. IN_IRQ MCE should be injected in interrupt context IN_PROC MCE injected in process context. HOLD Don't inject MCE immediately but hold it for later injection. This is useful for testing injection on multiple CPUs in parallel SOCKETID number Socketid. Ignored by kernel on injection. MCGCAP number Machine check capabilities register value. Currently ignored by kernel on injection. POLL Inject in machine check poller code. EXCP Inject as machine check exception. APICID number APIC ID of CPU. Ignored by kernel on injection. A list of symbols ors the symbols together. Multiple fields can be on a line. number can be hex/octal/decimal in the usual C format. Multiple machine checks can be in a single file, each new one starts with "CPU" or "MCE". For all missing fields reasonable default values are filled in (hopefully) Comments start with # until the end of the line The keywords are case-insensitive. REQUIREMENTS
mce-inject to run as root. /dev/mcelog has to exist. The kernels needs to have machine check injection ( CONFIG_X86_MCE_INJECT ) enabled and if modular the mce-inject module loaded. This option is available in Linux 2.6.31 or later. NOTES
The mce-test package has a lot of input file examples. SEE ALSO
Intel 64 and IA32 Architectures Software Developer's manual, Volume 3, System programming guide Parts 1 and 2. Machine checks are described in Chapter 14 in Part1 and in Append ix E in Part2. Datasheet of your CPU. Intel OTC May 2009 MCE-INJECT(8)
All times are GMT -4. The time now is 08:06 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy