Sun Fire V245 patching issue


 
Thread Tools Search this Thread
Operating Systems Solaris Sun Fire V245 patching issue
# 1  
Old 09-12-2014
Sun Fire V245 patching issue

Hi All,

I have an server Sun Fire V245 for which we had been trying to patch.
We get a message that the patch has been applied successfully but while trying to reboot after activating the new lu, the server is not booting up. Had reinstalled lustarter pack, required patches during the patching but still we were unable to boot the server with new patch.

I am following the lu uprade method. Have tried 4 times since last year but still it is not coming up.

Any help would be appreciated. Pasted teh error messages below.
Also a person who i do not know has suggested to uninstall CJD module due to which the server was not booting but not sure of CJD module as i couldnt find details regarding it anywhere.

Below is the log i had captured from Alom and the server keeps on looping but does not boot unless i send a break and boot it up.

Regarding CJD, though we were unable to boot with activateed lu, recently when i had to apply a particular single patch, i had applied it directly to the current environment and rebooted and the server got booted though CJD alerts were there so i am not sure whether CJD is a speedbreaker for the server to reboot with activateed lu.

Code:
 
Rebooting with command: boot
Boot device: /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1/disk@2,0:a  File and args:
SunOS Release 5.10 Version Generic_150400-11 64-bit
Copyright (c) 1983, 2014, Oracle and/or its affiliates. All rights reserved.
Hardware watchdog enabled
WARNING: emlxs0: Firmware update required.
        (A manual HBA or link reset using luxadm, fcadm, or emlxadm is required.)
Hostname:********** 
WARNING: /pci@1e,600000/pci@0/pci@8/SUNW,emlxs@0/fp@0,0/ssd@w203500a0b839c01c,1f (ssd3):
        Corrupt label; wrong magic number
WARNING: /pci@1e,600000/pci@0/pci@2/SUNW,emlxs@0/fp@0,0/ssd@w202400a0b839c01c,1f (ssd4):
        Corrupt label; wrong magic number
/usr/kernel/drv/sparcv9/cjd symbol log_init multiply defined
NOTICE: mod_install :0
Running cjboot script
Running cjboot start
cjd_start : Starting CJ on root device in cache mode.
NOTICE: attach:came here
NOTICE: attach returns success 314
NOTICE: trying to open cjd min:0
NOTICE: in ioctl...cmd:1074044936
NOTICE: CJ:FLR:ALERT:Allocating deventry 32 32
NOTICE: CJ:FLR:ALERT:Allocating disk 32
NOTICE: CJ:FLR:CRITICAL:DS : 10486144
NOTICE: CJ:FLR:CRITICAL:DS : 1310768
NOTICE: CJ:FLR:ALERT:DS : 81924
NOTICE: CJ:CCH:ALERT:Going for a cache alloc of 81924
NOTICE: CJ:FLR:ALERT:Using bpb value: 16
NOTICE: CJ:FLR:ALERT:Disk allocated/fnd ref : 0
NOTICE: CJ:FLR:ALERT:Switch queue succeeded for 32:32
NOTICE: CJ:CHR:ALERT:after jic
NOTICE: trying to open cjd min:0
NOTICE: in ioctl...cmd:1074044935
NOTICE: CJ:CHR:ALERT:Status 3
Starting Other Devices in Cache :
Exiting cjboot start
WARNING: emlxs1: Firmware update required.
        (A manual HBA or link reset using luxadm, fcadm, or emlxadm is required.)
Configuring devices.
panic[cpu1]/thread=2a1015fbc80: assertion failed: ldi_strategy(vd_lh, bp) == 0 (0x1 == 0x0), file: ../../common/fs/zfs/vdev_d isk.c, line: 411
000002a1015fb540 genunix:assfail3+94 (7aebcfa0, 1, 7aebcfc0, 0, 7aebcfc8, 19b)
  %l0-3: 0000000000000001 000000000000019b 0000000000000000 0000000001878400
  %l4-7: 0000000000000000 0000000001844000 0000000001296c00 0000000000000000
000002a1015fb600 zfs:vdev_disk_io_start+230 (2000, 42000, 60016935708, 60016f6b780, 60016a88000, 2000)
  %l0-3: 000000000000a000 0000000000080001 0000000000080000 000000000000c441
  %l4-7: 0000000000000001 000000007ae75378 000000007ae75000 0000000000000210
000002a1015fb6b0 zfs:zio_execute+b4 (60016bcc128, 0, 10, 6001442c028, 10000, 100000)
  %l0-3: 0000000000015000 00000000705a98e8 00000000001f8000 0000060014a20000
  %l4-7: 0000000000000002 000000007ae6deac 0000000000000080 0000000000000010
000002a1015fb760 zfs:vdev_probe+18c (60014a20000, 0, 0, 705a9800, 42000, 6001442
  %l0-3: 0000060016bcf8d0 0000060014a20618 0000000000000001 000000007ae4cae8
  %l4-7: 000000007ae4c800 000000000000c441 000000000000c441 0000000000000000
000002a1015fb840 zfs:vdev_open+430 (60014a20000, 3ffc40000, 0, 1, 1, 705aa8c8)
  %l0-3: 00000003f8000000 0000000000000009 0000060014a20000 0000000000000007
  %l4-7: 0000000000000000 0000000000000000 000002a1015fbc80 0000000000000000
000002a1015fb900 zfs:vdev_open_child+c (60014a20000, 60016fe98f8, 7ae4d088, 0, 2
  %l0-3: 00000000018684c0 0000000001856f10 00000000018684e0 0000060016fede18
  %l4-7: 0000000000000002 0000000000000002 0000000002000000 0000000000000000
000002a1015fb9b0 genunix:taskq_thread+3cc (60016fede50, 60016fedde8, 600124fc020
  %l0-3: 0000060016fe98f8 0000060016fede18 0000000000000001 0000000000080000
  %l4-7: 0000060016fede08 0000000000010000 00000000fffeffff 0000060016fede10
syncing file systems... 1 1 done
dumping to /dev/dsk/c1t0d0s1, offset 3436183552, content: kernel
 0:08 100% done
100% done: 49856 pages dumped, dump succeeded
rebooting...
SC Alert: Host System has Reset
Probing system devices
Probing memory
Probing I/O buses
screen not found.
keyboard not found.
Keyboard not present.  Using ttya for input and output.
Probing system devices
Probing memory
Probing I/O buses
 
Sun Fire V245, No Keyboard
Copyright 2007 Sun Microsystems, Inc.  All rights reserved.
OpenBoot 4.25.10, 8192 MB memory installed, Serial #78699744.
Ethernet address 0:14:4f:b0:dc:e0, Host ID: 84b0dce0.
 
Rebooting with command: boot
Boot device: /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1/disk@2,0:a  File and args:
SunOS Release 5.10 Version Generic_150400-11 64-bit
Copyright (c) 1983, 2014, Oracle and/or its affiliates. All rights reserved.
Hardware watchdog enabled
WARNING: emlxs0: Firmware update required.
        (A manual HBA or link reset using luxadm, fcadm, or emlxadm is required.)
Hostname: ********
WARNING: /pci@1e,600000/pci@0/pci@8/SUNW,emlxs@0/fp@0,0/ssd@w203500a0b839c01c,1f (ssd3):
        Corrupt label; wrong magic number
WARNING: /pci@1e,600000/pci@0/pci@2/SUNW,emlxs@0/fp@0,0/ssd@w202400a0b839c01c,1f (ssd4):
        Corrupt label; wrong magic number
/usr/kernel/drv/sparcv9/cjd symbol log_init multiply defined
NOTICE: mod_install :0
Running cjboot script
Running cjboot start
cjd_start : Starting CJ on root device in cache mode.
NOTICE: attach:came here
NOTICE: attach returns success 314
NOTICE: trying to open cjd min:0


Moderator's Comments:
Mod Comment Please use code tags next time for your code and data. Thanks

Last edited by Rockyc3400; 09-12-2014 at 09:46 AM.. Reason: Modified teh host name
# 2  
Old 09-12-2014
Hi,

You can try the following;

From the ok prompt "boot -F" or "boot -F failsafe" - if you are able to get the M/C up and running it looks like the FCA's need a firmware upgrade and they may not work on the upgraded version without it.

However if the box boots in failsafe mode you should have some more information on the diagnostic output. In which case you'll have to follow the advice.

Regards

Dave
# 3  
Old 09-15-2014
Hi Dave,

Generally what happens with this server is that we patch it using lu and once the server is not comming up, we login through the ALOM and enter into ok prompt and revert the server as we have very less downtime agreed by client.
However this time i will first try for and FCA upgrade and then will try to patch the server and see if it works. Also will try to capture the logs and also try to boot it into failsafe mode and see.

Can you please help me regarding FCA firmware from where i can get them as it is mentioned as emlxs, can it be downloaded from Oracle or do we need to approach for EMULEX. Is there any link from where i can get the firmware.

Regards,
Rocky
# 4  
Old 09-15-2014
What is the make and modell of the emulex adapter? a prtdiag -v output might give a hint (if you are able to provide that). or you have to check manually by open up the server... the FW can be downloaded at support.oracle.com for your HBA...
# 5  
Old 09-15-2014
Hi,

You may be able to get the driver here, but make sure you ger the correct one - older versions of the Emulex cards were very tricky to patch.

In some cases it requited you to actualy modify some of the Hex code in the card - not for the feint hearted.

Dave
# 6  
Old 10-24-2014
Hi Duke & Gull,

Thanks for your help and suggestions throughout.
The issue has been resolved and the server was patched successfully.
The issue was not with the drivers as i had updated the drivers and tried but still failed.
The CJD module was a utility from Backup tool (Backup express), which had modified the kernel paremeters and upon uninstalling the CJD module and patching, the server was patched successfully and booted without any issues.

Thanks again.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Solaris

Sun fire x2270

Hello, I have purchaced an old SUn fire x2270 server . I wanted to make ILOM upgrade to the latest version of software : ILOM 3.0.9.18.a r126592 BIOS vers. 2.09 Server 2.2.3 (10-Aug-2018) Because my version is very outdated. But i can't download the updatebecause it's require... (4 Replies)
Discussion started by: LouisLakoute
4 Replies

2. Hardware

SUN V245 Connecting To router Through Alom

Hello. I have a sun v245 and I have no hard drive for it yet but i would like to connect it to my home router by ethernet cable so I can use Alom remotely. Could someone please tell me how to set it up. Thank You (1 Reply)
Discussion started by: SunV245
1 Replies

3. Solaris

Sun-Fire-V490 Printer Issue After Upgrade of Solaris

Hey Guys I am new here, dont know if any one can assist me with this issue. I have a Sun-Fire-V490 machine that was upgraded to version 9 and patched a few months back. Problem is a few network printers managed by the server is printing an extra page that comes out before and after every print... (0 Replies)
Discussion started by: mprogams
0 Replies

4. Solaris

Sun Fire 4800 is not powering-on

I switched on the power to the server. But, the server did not power on i.e., none of the 3 LEDs on the front panel is lighted. (Power supplies are showing only amber LEDs with "Ready to remove" sign). I tried to turn on the power supplies via System Controller menu (platform shell), but it... (6 Replies)
Discussion started by: solind
6 Replies

5. Solaris

Sun-Fire V440 boot disk issue

Hi, I have Sun Fire V440. Boot disks are mirrored. system crashed and it's not coming up. Error message is Insufficient metadevice database replicas located. Use Metadb to delete databases which are broken. Boot disks are mirrored and other disks are ZFS configuration. Please... (2 Replies)
Discussion started by: samnyc
2 Replies

6. Solaris

Sun Fire 280R Sun Solaris CRT/Monitor requirements

I am new to Sun. I brought Sun Fire 280R to practice UNIX. What are the requirements for the monitor/CRT? Will it burn out old non-Sun CRTs? Does it need LCD monitor? Thanks. (3 Replies)
Discussion started by: bramptonmt
3 Replies

7. Solaris

Sun Fire v440 keeps shutting down

Hello, I hope you can help me. I am new to Sun servers and we have a Sun Fire v440 server in which one power supply failed, we are waiting for new one. But now our server is shutting down constantly. Is there any setting with which we can prevent this behaviour? (1 Reply)
Discussion started by: Tibor
1 Replies

8. UNIX for Dummies Questions & Answers

Sun Fire 280R

Hello all, I'm lost and can't figure this problem out. I have a Sun fire 280R running Solaris 8. Everything was working great. I have one drive in bay 1(not 0). But when I reboot the system it trys to open files in /dev/rdsk/c1t1d0s0. Should it have been opeing /dev/rdsk/c1t0d0s0, the... (4 Replies)
Discussion started by: larryase
4 Replies

9. UNIX for Advanced & Expert Users

Sun Fire v1280 is crashing

I still haven't got an answer to this question... Excerpt from My SysAd Blog A colleague of mine is having a problem with a Sun Fire v1280 server crashing. He tried Googling for the error message in red but hasn't found anything yet. Your insights would be greatly appreciated. "cannot... (2 Replies)
Discussion started by: esofthub
2 Replies
Login or Register to Ask a Question