LDoms disappeared


 
Thread Tools Search this Thread
Operating Systems Solaris LDoms disappeared
# 1  
Old 11-21-2017
LDoms disappeared

System: SPARC S7-2 Server; 2x8-core CPUs; 128Gb RAM; 2x600Gb HDD.

I have been experimenting on the above system, using ldmp2v to create "clones" of my physical systems as LDoms on the server when there was an unscheduled power outage. After the system came back up I had lost my LDoms, although the ZFS Volume backends still exist on the system.

Now, I am ready to accept the blame; that I should have saved a configuration so that the system knew what to do to bring them back up. Except that I haven't come across anything in the documentation to say that I had to do this; or what was required to do so.

Now it is possible that I should have used
Code:
ldm list-constraints -x domain

to save the LDom configuration, but where do I save it to?

Also, can I build new LDoms manually and attach the existing backends to them, or should I just delete the backends and start again?

As an aside, is it possible to mount and read these on the server?

Andrew
# 2  
Old 11-21-2017
Looks like you only lost the configuration, not the actual ldoms and data inside.
This is strange, even when power off occurred i never lost all configuration.

ldm list-constraints -x domain > /my/path/domain.xml will save the data xml format, which you can copy or backup anywhere you like.
Check out this link :
Saving Domain Configurations for Future Rebuilding - Oracle VM Server for SPARC 3.0 Administration Guide

Does ldm list-spconfig output anything besides factory-default

Also, if you know which backend devices you used for which ldom, you can create new one using that disk backend and boot the ldom.

Regards
Peasant.
This User Gave Thanks to Peasant For This Post:
# 3  
Old 11-22-2017
Hi,

Just as an aside, you can always import the zpools into the service domain and check them if you have to.

As there likely be a clash in the names, for the rpools - you'll likely have to use the numeric id to import or you can just import to a different name as in;

Code:
#> zpool import "oldname" "newname"

Or

Code:
#> zpool import 000001234567

Regards

Gull04
# 4  
Old 11-22-2017
Quote:
Originally Posted by Peasant
Looks like you only lost the configuration, not the actual ldoms and data inside.
This is strange, even when power off occurred i never lost all configuration.

ldm list-constraints -x domain > /my/path/domain.xml will save the data xml format, which you can copy or backup anywhere you like.
Check out this link :
Saving Domain Configurations for Future Rebuilding - Oracle VM Server for SPARC 3.0 Administration Guide
Thanks. I'll certainly check that out.
Quote:
Does ldm list-spconfig output anything besides factory-default
Funny story. While trying to figure out what happened I noticed that the output of ldm list produced the factory default of 128 VCPUs and 129280M memory, rather than the values I used when following the instructions here (Configuring the Control Domain). As I had pasted what I had done in a personal wiki I checked what I did and noticed this:
Code:
sog01(31)$ sudo ldm add-config initial
Error: Operation failed because a configuration
named "initial" already exists on the system controller.
Before being able to save a new configuration with
this name the existing one must be removed
sog01(32)$ ldm list-config
factory-default
initial [next poweron]

I missed that error at the time. So yesterday I followed the instructions again, this time changing the add-config command to
Code:
sudo ldm add-config main

and then checked that the configuration stuck by shutting down and literally pulling the plug out of the wall. Now:
Code:
sog01(13)$ ldm list-config
factory-default
initial
main [current]

Quote:
Also, if you know which backend devices you used for which ldom, you can create new one using that disk backend and boot the ldom.

Regards
Peasant.
Okay, I'll give that a try.

Andrew
# 5  
Old 11-22-2017
This is strange behavior.
Such systems should never lose configuration in that manner (reset to factory).

This would indicate issues with ILO, since configuration is saved there, and retrieved by the hypervisor during boot.
Did you ever issue save configuration command ldm add-spconfig <friendly_name_date> on that system ?

Check out firmware version and update if you can (since you have downtime now).

As for the hypervisor configuration, you will need to reconfigure primary with couple of cores / GB of ram, not leave it configured with all machines capacity assigned to primary (and in your case, control domain as well).

Other resources (CPU,RAM) are left out to ldoms.

Consider limiting the ZFS ARC on primary domain using set user_reserve_hint_pct=80 in /etc/system.

What are you using as disk devices for ldoms ?

Hope that helps
Regards
Peasant.
# 6  
Old 11-22-2017
Quote:
Originally Posted by Peasant
This is strange behavior.
Such systems should never lose configuration in that manner (reset to factory).

This would indicate issues with ILO, since configuration is saved there, and retrieved by the hypervisor during boot.
Did you ever issue save configuration command ldm add-spconfig <friendly_name_date> on that system ?
...

Hope that helps
Regards
Peasant.
Should I be running that every time I add/modify an LDom? The documentation is pretty poor and appears to be aimed at those who know what they are doing.

Andrew
# 7  
Old 11-23-2017
Yes, you should be running configuration saves to SP.
You can do it from crontab via script or manually after configuration change.

There are also some automatic options which i never explored.

Regards
Peasant.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Solaris

Solaris partition in boot screen disappeared - F11??

I have a problem where I installed several OSes as partitions on one disk. And suddenly I cannot see Solaris 11.3 in the bios boot screen anymore. I have no clue why. Do anyone have a suggestion so I can dig further somewhere? I first installed Solaris 11.3. Then Windows10 (gaming). Then Linux... (3 Replies)
Discussion started by: kebabbert
3 Replies

2. Post Here to Contact Site Administrators and Moderators

Threads disappeared

Dear admins, it seems that some threads or even users have recently (~ 2 days or so) disappeared. Examples: giuliangiuseppe and greycells. The latter asked me for the solution I provided earlier this week. What happened? Can you help? Regards Rüdiger (10 Replies)
Discussion started by: RudiC
10 Replies

3. Solaris

Moved zone and data disappeared?

Can't find the data in either pool. bash-3.00# zoneadm -z PPSMzone1 move /zpool/ppsmzone1 cannot create ZFS dataset zpool/ppsmzone1: dataset already exists Moving across file-systems; copying zonepath /rpool/PPSMzone1... Cleaning up zonepath /rpool/PPSMzone1... bash-3.00# zonecfg -z... (1 Reply)
Discussion started by: LittleLebowski
1 Replies

4. Solaris

XSCF prompt disappeared, Sun M5000

Hi, I've got an issue here: After I logon to the xscf prompt of this Sun M5000 and did 'XSCF> version -c xcp', the xscf prompt disappeared. I can't get it back and can't log out. exit rebootxscf logout #. #> #> ~# ~# exit sendbreak exit I tried to set the Mode Switch to the service... (3 Replies)
Discussion started by: aixlover
3 Replies

5. Solaris

LDoms can't ping each other

I've got Sun Fire T2000 with two LDoms - primary and ldom1, both being Solaris 10 u8. Both can be accessed over the network (ssh, ping), both can access the network, but they can't ping or ssh to each other. I only use e1000g0 interface on T2000, the primary ldom has an address on it, ldm has a... (1 Reply)
Discussion started by: mludvig
1 Replies

6. Solaris

HBA disappeared after reboot

Hello all I was configuring a SUN 2540 raid and after a reboot the hba`s is gone. There is no longer an entry in etc/path_to_inst for them (2 cards). I tried a reconfigure boot several times but it does not work. The hba`s is a SUN qlogic 2200 in x4240 server (AMD). Using solaris 10 update7.... (6 Replies)
Discussion started by: vettec3
6 Replies

7. UNIX Desktop Questions & Answers

MV'd a file, now it's disappeared...

I'm doing an assignment for a Unix course at school. I attempted to rename one of the shell programming questions from home to Q1 by typing: mv home Q1 It returned a message saying mv: cannot access home And now there's no home or Q1 in the directory. Please help! (8 Replies)
Discussion started by: slogged
8 Replies

8. Solaris

Help, my printers have disappeared

Hi all I have a really strange situation. This morning I ran lpstat -p and it didn't return any results. I ran lpstat -t and the scheduler is running. How strange, it seems all the printers have disappeared from my server. Can anyone perhaps explain to me how this is possible? (4 Replies)
Discussion started by: soliberus
4 Replies

9. UNIX for Dummies Questions & Answers

crontab disappeared

when i am doing the crontab editing i am using the setenv EDITOR /usr/bin/vi but their was an error i tried to put set EDITOR /usr/bin/vi and it wnet through and i started editing using the command crontab -e but somehow it is slow and displayed something like e300 and i tried to stop because it... (3 Replies)
Discussion started by: afuella
3 Replies

10. UNIX for Dummies Questions & Answers

My partition disappeared!!!! URGENT!! (newbie factor)

I just inst freeBSD boot installation and it didnt work for(probably my lack of knowledge) reasons but i now have to partitions in freeBSD and i really need them back for windows at the moment. i just cant find them. The bad thing is that i only got this bundled version of windows so i cant really... (2 Replies)
Discussion started by: riwa
2 Replies
Login or Register to Ask a Question