01-06-2013
Servers lacked maintenance, here's my to-do list
I'll be taking over administration of a rack of Solaris machines that haven't had an admin for the last 9 months. Prior to that they had limited maintenance. I understand there are a few tickets that will need addressed, but I won't have the details for a few days on them. Regardless, I'm trying to compile a to-do list. What would you add to this list?
- Check for hardware failures, disks, fans, psus, etc... repair as needed
- Ensure backups are being taken and are restorable
- Snapshot filesystems
- Check who has permissions to access these servers, internally and externally. Verify they all should have access.
- Reset the root passwords, and check who else may have root access via sudo, powerbroker (if used), or uids.
- Check all installed packages for exploits, update as needed
- Verify you have account access to the SC/SP/ALOM/ILOM over serial console. If I don't have access, look into resetting the password.
- Setup monitoring providing me with immediate access of issues.
- Identify critical apps, machines, etc... and prioritize them for support
- Acquire Oracle Support agreement details so if\when I need them, I have ready access.
- Check the cron tables on each system as well, just to see what the prior admins have tried to automate (system admin related or application related).
- Check the messages file on each system as well to catch any other issues that may have been written via syslog.
- Review logs specifically with a view to what has happened before\after reboots to return the server to the expected state.
- Check /var/crash/<hostname> to see if/when the last time the server may have panicked.
- Check if startup and shutdown of applications is implemented well and if it is automatic or manual
- Check for possible dependencies on other systems. Track ingoing and outgoing traffic if needed to check dependencies.
- Check external hardware, for example NAS / SAN Disk Arrays, Network and SAN-switches, UPS, Airco, etc...
- Try to track documentation and if possible reports of past changes and logs, if not available, see if you can interview old admin.
- Acquire a test system so I can try stuff out.
- Make a runbook.
What would you add\change\remove on this list? Thanks in advance for your help.
Last edited by DustinT; 01-07-2013 at 09:36 AM..
Reason: Added prioritize. Added Scrutinize and Bryan's tips. Attempted to prioritize.
10 More Discussions You Might Find Interesting
1. Solaris
Dear All,
My manager asked me to make full maintenance for my Sun Servers , to improve the performance. What do suggest ? what is the major things I have to do ? with some details pls because I am new admin for the System.
(currently we don't have any errors or any problem in the system... (5 Replies)
Discussion started by: ArabOracle.com
5 Replies
2. UNIX for Advanced & Expert Users
Hi All,
I am running my script in crontab and it is generating a file everyday.
And I have 10 different servers and i am running the same script in every server in crontab which generates a file in every server per day.
I want to retrieve those files from each server everyday.
I want to... (1 Reply)
Discussion started by: AshishK
1 Replies
3. Shell Programming and Scripting
Hi All,
I have a requirment, i need to get the list of all the process which are running from last two days on my unix server and also to put this list into an another file.
i am giving you a sample example :
$ ps -ef
UID PID PPID C STIME TTY TIME CMD (1 Reply)
Discussion started by: akshu.agni
1 Replies
4. Shell Programming and Scripting
Hi all,
This is my situation. First thing is I cannot use rsync to accomplish this. I don't have on my systems and we can't put it on. I run HP-UX 11v3.
I have a list of files generated every day which tells me which files are not in sync with the rest of the servers.I want to ls -l the... (1 Reply)
Discussion started by: zixzix01
1 Replies
5. Shell Programming and Scripting
Hello friends,
We have around 500 servers (HPUX, AIX and linux) and all of them need to be accessed from our management box (linux) via password-less ssh.
Out of 500 around 150 servers are setup password-less. We need to setup password-less SSH for remaining servers. First we need to get the... (3 Replies)
Discussion started by: magnus29
3 Replies
6. Shell Programming and Scripting
Hi
I have a file containing server names and i can ssh to all these servers without password.
Could any body suggest me how to list out IP address of all the servers?
Now i am manually doing this, like ssh to each server and run "ifcong -a" command and copy the ipaddress to a excel sheet.... (5 Replies)
Discussion started by: kumar85shiv
5 Replies
7. Shell Programming and Scripting
Hi Friends,
I have experience in redhat/ Ubuntu OS, but I am very new to solaries os.
my servers OS is Oracle Solaris 10 8/11 s10x_u10wos_17b X86.
I have a file contains 200 servers IPs one by one.
now I want a script to chaeck which IPs are pinging, not pingning.
I... (8 Replies)
Discussion started by: kumar85shiv
8 Replies
8. Shell Programming and Scripting
Dear All,
I am planning to find the list of certificates(WEBshpere/MQ) on a servers.
My certificates are either stored in (.jks) / (.pem)/ (.cer) . But some of the certificates are stored without these file formats. I tried using find command but unless I give the file name its difficult for... (4 Replies)
Discussion started by: sidh_arth85
4 Replies
9. Shell Programming and Scripting
I have 300 servers listed in servers.txt
I motto is to check if my id "user1" has sudo privileges on the 3000 servers. I m using sudo -l to check if I have privileges or not. If wish to check this either non-interactively; if not; interactively.
Below is the script I wrote:
... (5 Replies)
Discussion started by: mohtashims
5 Replies
10. Shell Programming and Scripting
Hello all,
I have a task to patch red hat servers and some servers have a corrupted rpm database and return the error:
Error: rpmdb open failed
I know how to fix this when it occurs. What I'm hoping to do is scan a list of servers by IP and report back which server have this error.
... (6 Replies)
Discussion started by: greavette
6 Replies
LEARN ABOUT CENTOS
repoclosure
repoclosure(1) repoclosure(1)
NAME
repoclosure - display a list of unresolved dependencies for a yum repository
SYNOPSIS
repoclosure [options]
DESCRIPTION
repoclosure is a program that reads package metadata from one or more yum repositories, checks all dependencies, and displays a list of
packages with unresolved dependencies.
OPTIONS
-h, --help
Display a help message, and then quit.
-c CONFIG, --config=CONFIG
Config file to use (defaults to /etc/yum.conf).
-a ARCH, --arch=ARCH
Check packages of the given archs, can be specified multiple times (default: current arch).
--basearch=BASEARCH
Set the basearch for yum to run as.
-b, --builddeps
Check build dependencies only (needs source repos enabled).
-l LOOKASIDE, --lookaside=LOOKASIDE
Specify a lookaside repo id to query, can be specified multiple times.
-r REPOID, --repoid=REPOID
Specify repo ids to query, can be specified multiple times (default is all enabled).
-t, --tempcache
Use a temp dir for storing/accessing yum-cache.
-q, --quiet
Run quietly: no warnings printed to stderr.
-n, --newest
Check only the newest packages in the repos.
--repofrompath=REPOID,PATH/URL
Specify a path or url to a repository (same path as in a baseurl) to add to the repositories for this query. This option can be used
multiple times. If you want to view only the pkgs from this repository combine this with --repoid. The repoid for the repository is
specified by REPOID.
-p PKG, --pkg=PKG
Check closure for this package only
-g GROUP, --group=GROUP
Check closure for packages in this group only
SEE ALSO
yum.conf (5)
http://yum.baseurl.org/
AUTHORS
See the Authors file included with this program.
13 January 2013 repoclosure(1)