Quote:
Originally Posted by
angshu
My biggest problem is , am totally blank when it comes to the idea of routine jobs that are performed by unix admins.
Off the top of my head....
1. machines have to start, did this go okay, was it scheduled, were any suprises found, did all the expected/required services/daemons start?
2. machines use resources, is their sufficient capacity for the anticipated tasks? What do you do if there isn't?
3. what is appearing in the logfiles, are the things you expect there, and the things you don't not there?
4. how are the machines performing?
5. are the backups scheduled? occuring? reliable? offsite?
6. are there any security alerts from the manufacturer or third parties?
7. ditto for software updates.
8. then do this for a load of machines.
9. think about proactively avoiding problems
10. and automating as much as possible, grow your own toolbox for this
11. make sure all changes are authorised through change management
12. be responsive to emergencies and failures