Any hope for this bootlooping Sun V210?


 
Thread Tools Search this Thread
Operating Systems Solaris Any hope for this bootlooping Sun V210?
# 1  
Old 11-27-2018
Any hope for this bootlooping Sun V210?

Hi,


First post here!


I have a Sun V210 that I use occasionally for build testing things big-endian. I switched it on the other day, at it aint comin' up. I was wondering if anyone on this fine forum knows if it can be brought back from the dead.


With the SCC card in, and conencted to the serial management line, if I switch on I get:


Code:
ALOM - POST run incomplete previously, no POST this time

ALOM BOOTMON v1.6.10
ALOM Build Release: 001
Reset register: e8000000 EHRS ESRS LLRS CSRS


Check for Handshake


Returned from Boot Monitor and Handshake



Clearing Memory Cells
Memory Clean Complete


Loading the r�
ALOM - POST run incomplete previously, no POST this time

ALOM BOOTMON v1.6.10
ALOM Build Release: 001
Reset register: e8000000 EHRS ESRS LLRS CSRS


Check for Handshake


Returned from Boot Monitor and Handshake



Clearing Memory Cells
Memory Clean Complete


Loading the r�
ALOM - POST run incomplete previously, no POST this time

The fans are screaming away while these messages are looping.


If I remove the SCC card, there is more info (but not fans):


Code:
ALOM - POST run incomplete previ�
ALOM - Could not get all data from I2C - min post, no power on
ALOM - Could not get diag-switch from I2C

ALOM BOOTMON v1.6.10
ALOM Build Release: 001
Reset register: e0000000 EHRS ESRS LLRS


ALOM POST 1.0


Dual Port Memory Test, PASSED.

TTY External - Internal Loopback Test
TTY External - Internal Loopback Test, PASSED.

TTYC - Internal Loopback Test
TTYC - Internal Loopback Test, PASSED.

TTYD - Internal Loopback Test
TTYD - Internal Loopback Test, PASSED.


Memory Data Lines Test
Memory Data Lines Test, PASSED.

Memory Address Lines Test
  Slide address bits to test open address lines


ERROR: ALOM POST TEST
H/W under test    = Memory System Address Lines
    Test name     = Memory Address Lines Test
    Subtest name  = Memory Address Test 2

    Failure: Writing 0xFF to offset Address of 00000020

              Testing Address Line - SSP_ADDR<26> 

    Most LIKELY cause(s) of this failure include:

      The interconnection may be bad

          SSP_ADDR lines from U0301 to U0503
          SSP_DAT lines from U0301 to U0503

      U0503 the DRAM may be bad

END_ERROR



ERROR: ALOM POST TEST
H/W under test    = Memory System Address Lines
    Test name     = Memory Address Lines Test
    Subtest name  = Memory Address Test 4

    Failure: Base write to 0x0 affected offset of 00000020

              Testing Address Line - SSP_ADDR<26> 

    Most LIKELY cause(s) of this failure include:

      The interconnection may be bad

          SSP_ADDR lines from U0301 to U0503
          SSP_DAT lines from U0301 to U0503

      U0503 the DRAM may be bad

END_ERROR
ERROR: ALOM POST TEST
H/W under test    = Memory System Address Lines
    Test name     = Memory Address Lines Test
    Subtest name  = Memory Address Test 2

    Failure: Writing 0xFF to offset Address of 00200000

              Testing Address Line - SSP_ADDR<22> 

    Most LIKELY cause(s) of this failure include:

      The interconnection may be bad

          SSP_ADDR lines from U0301 to U0503
          SSP_DAT lines from U0301 to U0503

      U0503 the DRAM may be bad

END_ERROR
ERROR: ALOM POST TEST
H/W under test    = Memory System Address Lines
    Test name     = Memory Address Lines Test
    Subtest name  = Memory Address Test 4

    Failure: Base write to 0x0 affected offset of 00200000

              Testing Address Line - SSP_ADDR<22> 

    Most LIKELY cause(s) of this failure include:

      The interconnection may be bad

          SSP_ADDR lines from U0301 to U0503
          SSP_DAT lines from U0301 to U0503

      U0503 the DRAM may be bad

END_ERROR

  Test for shorted address lines
ERROR: ALOM POST TEST
H/W under test    = Memory System Address Lines
    Test name     = Memory Address Lines Test
    Subtest name  = Memory Address Test 5

    Failure: Writing data.
    Error at memory address: 00000020
    Good data was:           00000006
    Bad data was:            00000008
    XOR data was:            0000000e

    Most LIKELY cause(s) of this failure include:

I'm able to invoke the escape menu, and I've tried resetting the ALOM from there. No cigar.


So looks like bad RAM or bad RAM controller? I tried removing all RAM and booting. No change.


I notice a lot of jumpers on the main board, but searching the internet, I can't find their functions. I wonder if any of those could help?



Any hope for this poor machine? Thanks
# 2  
Old 11-28-2018
Hi,

Having had a quick look through the logs that you've posted - I suspect that the ALOM has the issue!

Although it's a long shot here, the system should disable any faulty FRU's and allow you to login over the network. This presumes that the system is set to do that and that the faulty FRU has some redundancy.

You can then use the eeprom command to change the console output to TTYB (10101 on the rear panel) using eeprom setenv input-device ttyb where you should see the ok prompt you may also have to set the console as well.

Regards

Gull04
# 3  
Old 11-28-2018
Hi gull04,


Thanks for the reply.


I'll give that a shot!


I was hoping that the serial ALOM could be revived somehow, but I guess not.


I do actually have a spare mainboard that I could put in, but I've been hesitant because it needs an ALOM password reset and I don't have a Solaris install to hand to run scadm :\
# 4  
Old 11-28-2018
Hi Vext01,

I'm maybe a bit rusty on the vSeries now, but the default ALOM user and password was a "joey" account "admin" - you may be can return to that status by removing the button battery for a while. Also on the v210 remember to transfer the CCS card (Server won't boot without it as it contains both Mac Address and Hostid details).

Regards

Gull04
# 5  
Old 11-28-2018
Sadly the ALOM password has been reset by whoever owned the motherboard before.


Even more sadly, the ALOM password is stored in flash memory, so removing the battery doesn't kill the password.


I found this article (which I'm unable to link to because I'm a new member -- the title is "unbricking a sun fire v210" if you wanted to search for it).



He says:





Quote:
In contrast to x86 systems where you can most often short some jumpers or remove CMOS batteries most values in OpenBoot are stored in persistent storage areas, only the system clock is buffered by a CR2032 battery.

But he does say:
Quote:
I tried powering up the system without any disk drives and the SCC pulled. OBP greeted me with errors on missing IDprom but went straight to a standard ‘ok’ prompt, now I had something to work with…

So I think I can install Solaris via that way, and then reset the ALOM password with scadm.
# 6  
Old 11-28-2018
Hi,

You could try that, if you have a Solaris DVD you could boot that and reset the ALOM password - on the basis that you can get to the ok prompt.

Regards

Gull04
# 7  
Old 11-28-2018
Well, I was unable to install solaris on the other machine, as it was a "managed system", meaning that you can't boot it with the SCC card ejected. It would just shutdown if you tried to turn it on.



I used another SCC card with a known password to boot it. This confirms the password is stored on the SCC card.



The problem I'm faced with now is that I can't get a 'OK>' prompt with 'console -f', even after resetting all ALOM settings to defaults with 'setdefaults -a'.


Any ideas why this would be?
Login or Register to Ask a Question

Previous Thread | Next Thread

7 More Discussions You Might Find Interesting

1. Solaris

Sun Fire V210 CPU Fan Temp too high?

Hey, I have a V210 with a failed CPU fan. The temperature is currently at 84C and I've been asked to wait a few weeks before replacing as its a production system and it cant be shut down yet. Is it too hot? Do I risk killing the CPU at this temp? Its been like this for a few weeks now... (5 Replies)
Discussion started by: magarvo
5 Replies

2. UNIX for Dummies Questions & Answers

New to Forum & Sun Surefire V210 Access

Purchased a Sun Surefire V210 Server off eBay. Unable to Access the Terminal Mode via the Terminal MGT. Using Windows 7 home, and downloaded the ConEmu. The ConEmu brings up a Command line on the PC, and that's it. Being new to all this, I was expecting a Login prompt to pop up. Read the... (22 Replies)
Discussion started by: screenprintr
22 Replies

3. Solaris

Connect using ALOM to Sun Fire V210

I have bought from eBay a second hand Sun Fire V210 server and I'm really stumped at the lack of complete instructions on how to connect to it. I don't have a Windows machine, I've only got Ubuntu and OS X computers. None of them have an old RS-232 port on them either. In saying that, I have... (12 Replies)
Discussion started by: danijeljames
12 Replies

4. Solaris

Booting error in Sun V210

Sun Fire V210, No Keyboard Copyright 1998-2003 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.13.2, 4096 MB memory installed, Serial #61203679. Ethernet address 0:3:ba:a5:e4:df, Host ID: 83a5e4df. Boot device: net File and args: 100 Mbps FDX Link up Timeout waiting for... (5 Replies)
Discussion started by: Mrudhul
5 Replies

5. Solaris

V210 to V440

I'm currently trying to move a perfectly find harddrive from a V210 to a V440. From what I can tell, the disk labeling a bit different, (V210 is c1t0d0 and V440 is c0t0d0). My question is, what all do I have to change to get the V440 to boot off of this with very little complications. Right now, it... (21 Replies)
Discussion started by: adelsin
21 Replies

6. Solaris

V210 to V440

I recently acquired a server for home use. Currently, I'm running a V210. I was wondering of a way to basically swap hard drives into the V440. I would like to avoid reinstall on the V440 for many reasons. Currently on the V440, when I try to boot up it forces itself into System Maintenance... (1 Reply)
Discussion started by: adelsin
1 Replies

7. Solaris

Sun Fire v210 display card

hi all, how can install a display card on a sun fire v210. regards. marcel (2 Replies)
Discussion started by: marcelious
2 Replies
Login or Register to Ask a Question