Bizarre Sun T5240 behavior


 
Thread Tools Search this Thread
Operating Systems Solaris Bizarre Sun T5240 behavior
# 1  
Old 08-03-2011
Bizarre Sun T5240 behavior

Hi -

I have a T5240 with 7 LDOMS configured. One night, network comm was broken somehow. Nobody was doing anything on the machine at the time. Here is what I saw in messages:

Code:
WARNING: nxge3 : nxge_dma_mem_alloc: ddi_dma_mem_alloc kmem alloc failed
WARNING: nxge3 : nxge_alloc_rx_buf_dma: Alloc Failed: dma 12 size_index 10 size requested 4194304
WARNING: nxge3 : ==> nxge_alloc_rx_buf_dma: not enough for channel 12 allocated 0x200000 requested
WARNING: nxge3 : <== nxge_init_rxdma: status 0x40000000
WARNING: nxge3: nxge_grp_dc_add (12): channel init failed
NOTICE: nxge3: xcvr addr: 0x1a - link is down

So, it took down ALL of the interfaces (the entire quad card). It appears that it cannot allocate memory..... The funny thing is, I am not even using nxge3 - nothing is plugged into it, it has never been plumbed. I AM using nxge0,1,2 as aggr1 for vsw0. I shutdown the machine, pulled the power plugs and then booted.

After it came up, the network looked fine. The vsw0 was working properly. But, "svcs -a" reported the following problems:

Code:
maintenance  12:19:58 svc:/ldoms/vntsd:default
maintenance  12:21:36 svc:/ldoms/ldmd: default

So, I could not start my ldoms. Doing the following, resulted in such:

Code:
cat /var/svc/log/ldoms-ldmd:default.log

......
warning: unable to reconfigure CPUs in guest primary
Executing start method ("/opt/SUNWldm/bin/ldmd_start")
Method or service timed out. Killing contract 43

Also, about the same time, in dmesg output:

Code:
vdc: [ID 995498 kern.notice] NOTICE: [2} disk access failed.

After a while poking around and such and running svcadm enable/restart, etc. We got the ldm and vntsd running again - independant of the service mgmt facility. It still shows them in maintenance mode.

I brought up all of the LDOMS successfully and it all seems to be running fine.

I am just not sure what started this whole thing. Now I am getting (where I was not getting them before):

Code:
nxge: [ID 339653 kern.notice] NOTICE: nxge3: xcvr addr:0x1a - link is down.

Any ideas at all?????

Last edited by pludi; 08-03-2011 at 05:13 AM..
# 2  
Old 08-03-2011
Maybe someone filled /tmp=swap, so malloc()/brk() was failing.

Maybe that link has some configuration parameter too high, tries to malloc() for 4GB in a 32 bit app.

Got all patches? http://wesunsolve.net/bugid/id/6768523

Maybe the card is broken? Static discharge?

Last edited by DGPickett; 08-03-2011 at 01:45 PM..
# 3  
Old 08-03-2011
Bizarre Sun T5240 behavior

Thanks DGPickett - I'll check it out.
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. Solaris

Using a Modem on a T5-2 or a T5240

Hi Folks, Just a quick question - hopefully! I have an application currently running on a V890 with Solaris 9, I'd like to move this to either one of our T-5's or one of the T5240's in a Legacy container on an LDOM - but the fly in the ointment is the application still uses a standard Hayes... (3 Replies)
Discussion started by: gull04
3 Replies

2. Solaris

Cannot configure RAID on T5240

Hi ! I've been given a T5240 with 4 disks and 2 HBA cards (but no array connected). I did a factory reset on SP and NVRAM clean on OBP because the server had been used before. I boot cdrom in single mode and try to create a hw mirror with disks from c1... but only c2 is seen by raidctl. ... (2 Replies)
Discussion started by: delavega
2 Replies

3. Solaris

ufsrestore T5240

Hi Friends we have sun T5240 server, we have taken ufsdump of this server remotely with scsi tapedrive, If we need to do ufsrestore means what we have to do, since T5240 has not having scsi port, any procedure is there? Regards Rajasekar (5 Replies)
Discussion started by: rajasekg
5 Replies

4. Solaris

SUN T5240 vs M3000

Hi, We are planning to buy new server for our data center. Sun T5240 or M3000 which one have better performance, we are going to create many dt sessions in this server. So, i need your suggestions. RJS (4 Replies)
Discussion started by: rajasekg
4 Replies

5. Solaris

Raidctl - Sun T5240 Solaris 10 Problem

I tried using raidctl earlier today to use my 2 disks in a RAID1 setup and I totally destroyed my OS install. I'm sure I did something funky and it freaked out. No big deal...right? This is what I was seeing after a reboot. I decided to just reinstall the OS. It let me go through all of... (3 Replies)
Discussion started by: kingdbag
3 Replies

6. Solaris

cannot plumb nxge on T5240

its a fresh installation. during the OS setup, it did not prompt for IP, netmask and gateway. using Solaris 10 08/07 update 4. I tried to plumb manually but encountered no such interface error. but nxge interfaces can be greped from the /etc/path_to_inst file. getting similar error on... (3 Replies)
Discussion started by: incredible
3 Replies

7. Shell Programming and Scripting

bash-function with array acting bizarre, bug?

Hello, basically what this script is supposed to do is showing a list of hosts that is given a number, that you will be able to choose from a list. A check is made to verify that the chosen number is within the array and this is where things go bad and I don't know why, bizarre. I've spent... (5 Replies)
Discussion started by: gand
5 Replies

8. Solaris

raidctl on SUN T5240

Setting up a T5240 with two disks c1t0d0 and c1t1d0. I am trying to use raidctl but when I issue. raidctl -l I get Controller 1 Disk: 0.0.0 Disk: 0.1.0 So I try raidctl -c '0.0.0 0.1.0' -r 1 1 and I get "Array in use." I try (4 Replies)
Discussion started by: photon
4 Replies

9. Solaris

Weird behavior on a Sun Fire V120 running solaris 10.

All, After a power loss I went to power on our sun fire v120 that is running solaris 10 and now it will not boot. I tried power cycling it from the lom and pulling the cord but nothing works. All it does is after a power cycle it will start to boot and then start to spit out a bunch of hex... (2 Replies)
Discussion started by: jsandova
2 Replies
Login or Register to Ask a Question