12-28-2011
Sun Fire v440 Over heat Problem.
Dear Team,
I need some expert advice to my problem.
We have a Sun Fire v440 in our customer Place. Server is working fine and no hardware deviations are found except one problem that processors generating too much heat. I have verified and found that the room temperature was 26-27 degree.
The tempearatures of the processors are also flctuating over the day. If we put a Fan behind the server,the temp decreases to 2-3 degrees . I think due to excess room temp,this overheat issue arises. Is my assumption correct or is there any other factorsare there which related to the overheat issue. Please suggest.
NOTE: No errors found in messages files also
i am sending the logs of the server for better understanding. these are the readings taken after 2 hours duration.
chk_server.sh Wed Dec 28 08:00:00 IST 2011
Wed Dec 21 08:00:00 IST 2011
############# Server Temperature ##############
c0_p0_t_cor | 94
c1_p0_t_cor | 89
c2_p0_t_cor | 93
c3_p0_t_cor | 96
c0_t_amb | 31
c1_t_amb | 31
c2_t_amb | 31
c3_t_amb | 31
scsibp_t_am | 29
mb_t_amb | 35
chk_server.sh Wed Dec 28 10:00:00 IST 2011
############# Server Temperature ##############
c0_p0_t_cor | 91
c1_p0_t_cor | 87
c2_p0_t_cor | 90
c3_p0_t_cor | 93
c0_t_amb | 28
c1_t_amb | 28
c2_t_amb | 28
c3_t_amb | 29
scsibp_t_am | 27
mb_t_amb | 33
Thanks
Sudhansu
10 More Discussions You Might Find Interesting
1. Solaris
Got an curious issue.
I applied 109147-39 to, oh 15 or so various systems all running Jumpstarted Solaris 8. When I hit the first two V440s, they both failed with Return code 139. All non shell commands segfaulted from then on.
The patch modified mainly the linker libraries and commands.
... (2 Replies)
Discussion started by: BOFH
2 Replies
2. Solaris
Hello,
I hope you can help me. I am new to Sun servers and we have a Sun Fire v440 server in which one power supply failed, we are waiting for new one. But now our server is shutting down constantly. Is there any setting with which we can prevent this behaviour? (1 Reply)
Discussion started by: Tibor
1 Replies
3. Solaris
Dear All,
I am facing a specfic problem with my New SunFire T2000.Recently we bought new sunfire T2000 sparc server.When i am trying to install solaris 10 through cdrom , I get error messgae
Error:Last Trap: Instaruction Access Exception
{0} ok boot cdrom
Boot device:... (6 Replies)
Discussion started by: solaris8in
6 Replies
4. Solaris
First of all it's shut down 60 second after power on and write on console :
SC Alert: Correct SCC not replaced - shutting managed system down!
This is cured by moving out battery from ALOM card.
Now server start to loop during the testing.
That's on the console:
>@(#) Sun Fire V440,Netra... (14 Replies)
Discussion started by: Alisher
14 Replies
5. Solaris
Hello,
I am seeing error messages in V440 (OS = solaris 8). I have copied here :
The system does not reboot constantly and it is up for last 67 days. One more interesting thing I found, I see errors start appearing at 4:52AM last until 6am and again start at 16:52am on same day..
I... (5 Replies)
Discussion started by: upengan78
5 Replies
6. Solaris
Hi,
I was asked to connect a KVM screen to a Sun Fire V440 last night so I connected it up but no joy and nothing on the KVM screen. I was told that a reboot may fix the problem so connected to the ALOM and rebooted. On the plus side, the KVM screen now works but I lost the ALOM connection.
... (0 Replies)
Discussion started by: jimmy54321
0 Replies
7. Solaris
Hi:
I bougth an used Sun Fire v440, and It have a firmware password. When I turn on the server, it ask for firmware password. (I don 't know what is the correct password). I can access to SC, but when I want to access to OBP, Firmware Password appears again. I remove the battery for two hours,... (1 Reply)
Discussion started by: mguazzardo
1 Replies
8. Solaris
Hi,
I have Sun Fire V440. Boot disks are mirrored. system crashed and it's not coming up. Error message is
Insufficient metadevice database replicas located. Use Metadb to delete databases which are broken.
Boot disks are mirrored and other disks are ZFS configuration. Please... (2 Replies)
Discussion started by: samnyc
2 Replies
9. Solaris
Hi,
I have a SUN Fire V440 server running Solaris 8. One of the 4 disks do not appear when issued the format command. The "ready to remove" LED is not on either.
Metastat command warns that this disk "Needs maintenace". Can I just shutdown and power off the machine and then insert an... (5 Replies)
Discussion started by: Echo68
5 Replies
10. Solaris
Hi,
I have a Sun Fire V440 server that fails to boot up correctly. A lot of services are not started and the sytems acts really slow to commands. During boot I can see the following Error:
WARNING: /pci@1f,700000/scsi@2/sd@0,0 (sd1):
SCSI transport failed: reason 'reset': retrying... (15 Replies)
Discussion started by: oliwei
15 Replies
wrsm(7D) Devices wrsm(7D)
NAME
wrsm - WCI Remote Shared Memory (WRSM) device driver
SYNOPSIS
wci@<slot>,0:wrsm
wrsm@<instance>:ctrl
wrsm@ffff,0:admin
DESCRIPTION
The wrsm driver is a nexus driver that manages Sun Fire Link devices and wrsm controllers.
A WCI device on a Sun Fire Link board is attached directly to the host system bus and provides clustering communication between Solaris
instances that are memory transaction-based. The WCI acts as a memory controller on the system backplane. The wrsm driver programs regis-
ters on the WCI to accept network read/write requests on certain exported cluster addresses from incoming links. The registers translate
the requests into local read/write bus transactions that use local physical memory ranges that you specify. The driver programs additional
WCI registers to forward local system backplane read/write transactions within a particular physical address range to a remote WCI. A WCI
device in the format wci@slot,0:wrsm appears in the device tree.
A wrsm controller is a pseudo device that manages a set of WCIs. A device entry in the format wrsm@<instance>:ctrl appears in the device
tree. A wrsm controller presents a Sun proprietary protocol to clients, enabling them to set up the network and to communicate through the
WCIs. To configure a wrsm controller, you download a configuration into the driver using the wrsmconf(1M) command or through other external
WCI network management software. Status information on each WCI and wrsm controller is available by using the wrsmstat(1M) command.
The wrsm admin device is used internally by the driver to manage the I/O addresses associated with remote memory. A device entry in the
format wrsm@ffff,0:admin appears in the device tree
FILES
/platform/sun4u/kernel/drv/sparcv9/wrsm
ELF kernel module
SEE ALSO
wrsmconf(1M), wrsmstat(1M)
Writing Device Drivers
DIAGNOSTICS
The messages described below may appear on the system console as well as being logged. These messages generally include the string wrsm%d,
where %d is the instance number of the wrsm device. The message context indicates whether the device is a WCI or a wrsm controller. Some
messages include the string wci %a, where %a is the bus slot of the WCI device.
wrsm%d: unable to map register set %d
Driver was unable to map device registers; check for bad hardware. Driver did not attach device, device will be inaccessible.
wrsm_detach:cf_remove_controller failed for wrsm%d
Driver did not detach device; device is inaccessible.
wrsm_detach:cf_remove_wci failed for wrsm%d
Driver did not detach device. This WCI is the last WCI in wrsm controller.
register_controller of wrsm%d failed with error %d
The wrsm controller could not register with the Sun proprietary protocol framework. Communication is not possible through this con-
troller.
wrsm%d, wci %a, SRAM CE ERROR, at address: 0x%x, syndrome:0x%x
There was a correctable error in the WCI's SRAM. This indicates that the memory on this WCI module should be replaced.
wrsm%d, wci %a, SRAM UE ERROR, at address: 0x%x, syndrome:0x%x
There was an uncorrectable error in the WCI's SRAM. This indicates that the memory on this WCI module should be replaced. In addition,
attempts to access local memory from remote nodes may fail.
SunOS 5.10 17 Nov 2002 wrsm(7D)