Visit Our UNIX and Linux User Community


Intermitent System Reboot's


 
Thread Tools Search this Thread
Special Forums Hardware Filesystems, Disks and Memory Intermitent System Reboot's
# 1  
Old 10-24-2002
Error Intermitent System Reboot's

Hi all,

Just started holidays (ya!) and Murphy's Law has kicked in already (doh!).

I'm looking after (when at work) two SCO 5.0.5 Systems running on Netfinity 5500 Servers (Model # 8662-3RY). Every once and a while the production server just reboot's itself. There is no mention of a cause in the system/debug/message/error logs and I'm now looking for alternative sources of information that might be a cause.

Does anyone have any ideas / suggestions as to where I might look or have seen an incident similar in your own travels?
# 2  
Old 10-24-2002
Once i heard the story about the cleaner who pulled out the power cord, he needed the outlet for his vacuumcleaner
# 3  
Old 10-24-2002
I've heard that story too. Another is that the cleaner plugged his equipment into a power strip and overloaded it causing a row of servers to go down.

As far as the real problem here - not much you can do. If there is nothing in ANY of the error logs, you could always call in hardware support to run test (and yes, I know this is production). Probably your best bet is to take a full backup of the server, buy/steal/borrow the exact type of equipment, build a duplicate and see if the problem shows up on it. If so, software or something running at the same time causes the crash. If not, hardware on the original - but good luck figuring it out until it either sends a message or fails completely (usually memory is a culprit in those situations).
# 4  
Old 10-24-2002
Guys,

Found this in the /var/adm/syslog (must have been looking in on planet Mars the first time). The following is an extract of the syslog that is generated from a script that checks the log every evening at 23:55.
Code:
Oct 24 12:37:37 www3 lmail[27608]: Cannot open /usr/spool/mail/nobody: Operation would block
Oct 24 14:46:50 www3 lpd[381]: unknown printer: lf=/var/spool/lp/logs/output_log
Oct 24 14:46:57 www3 ifor_sld: PMDCT: Error accepting server side connection. (PM_THREAD_IPC_TIMEOUT) 
Oct 24 14:46:57 www3 ifor_sld: PMDCT: Error accepting server side connection. (PM_THREAD_IPC_TIMEOUT) 
Oct 24 14:46:57 www3 ifor_pmd: cleanup; terminating
Oct 24 14:46:57 www3 ifor_pmd: cleanup; terminating
Oct 24 14:46:57 www3 sco_cpd: cpd: pmd died
Oct 24 14:46:57 www3 Xsco[406]: Xsco: ERROR- Failed to initialize policy manager. (IFOR_PM_FATAL)
Oct 24 14:46:57 www3 Xsco[406]: Xsco: ERROR- Failed to initialize policy manager. (IFOR_PM_FATAL)
Oct 24 14:46:57 www3 ifor_pmd: terminated with status 100
Oct 24 14:46:57 www3 ifor_pmd: terminated with status 100
Oct 24 14:47:00 www3 ifor_pmd: ^M
         The Licensing Policy Manager Daemon (ifor_pmd) has terminated^M
         and been restarted.  This is a normal occurrence only when a^M
         license is removed with the License Manager utility.  If this is^M
         not the case, your system may have a problem which could lead to^M
         undesirable behavior.  Contact your SCO service provider for^M
         help if you suspect that there is a problem.^M 
Oct 24 16:34:25 www3 ftpd[15956]: #2 open of pid file failed: No such file or directory
Oct 24 16:40:01 www3 ftpd[18453]: #2 open of pid file failed: No such file or directory

What's this Licensing Policy Manager Daemon (ifor_pmd) ??
# 5  
Old 10-24-2002
This should not cause a reboot.

Do you have more information about the reboots?
e.g. like alway at the same day/time, or always when a specific job is running?

when this happens mostly it had to do with the power or rear hardware problems, because otherwise the systems logs the reason of this PANIC reboot.
# 6  
Old 10-24-2002
One of your errors is posted at PCUnix.com SCO FAQ ( ifor_pmd failed to init)
# 7  
Old 10-24-2002
It is not out of the realm of possibility that you have a hardware problem.

I have systems that will panic and reboot when they have a hardware failure with a CPU or a card or even with memory. I would suggest that you have your hardware looked at.

If you have any crash dumps I would look at those, if your system has such a thing.

Smilie

Previous Thread | Next Thread
Test Your Knowledge in Computers #792
Difficulty: Easy
In mathematics and digital electronics, a binary number is a number expressed in the base-2 numeral system or binary numeral system.
True or False?

10 More Discussions You Might Find Interesting

1. UNIX for Advanced & Expert Users

Intermitent issue with passwordless login

Hi, I have two users login on my HPUX i.e user1 and user2. Inorder to switch from user1 to user2 without password I generated key pair using keygen command for user id user1 and copied the its public key to the .ssh/authorized_keys file of user2. ssh-keygen -b 2048 -t rsa This... (5 Replies)
Discussion started by: mohtashims
5 Replies

2. Solaris

system reboot procedure

Hi all, i want to know the procedure reboot the server through console if the system is completely down.Please help me if anybody knows this. I would really thanfull to all. regards Krishna Murthy (1 Reply)
Discussion started by: murthy76
1 Replies

3. Solaris

System Reboot Log

Hello Everyone , I am a new member to this forum and came to know about this from so many of my friends . I face one issue last day when suddenly the system got rebooted ( But don,t know why ) . Can someone please help me in investigating this issue as to why the system got rebooted and who... (4 Replies)
Discussion started by: gera_sachin125
4 Replies

4. Solaris

system self reboot

dear all, I have 2 T2000 with solaris 10 and oracle 10g installed on it. these two servers are rebooted by itself. could anyone help me investigate the cause. the message log is attached thanx, (3 Replies)
Discussion started by: fsmadi
3 Replies

5. UNIX for Dummies Questions & Answers

System Reboot Dates

Hi, Can someone advise me how to find the last system rebooted date(s)? Thanx N Regards, Hush (2 Replies)
Discussion started by: hush
2 Replies

6. Shell Programming and Scripting

What are the differences between 'bash' and 'sh'

Hopefully this doesn't come off as too much of a "newbie" question or a flamebait. But I have recently begun working with a Sun Solaris box after having spent the past five years working with RedHat. From what i can tell, thing look fairly similar and the 'man' command is some help. But I've... (7 Replies)
Discussion started by: deckard
7 Replies

7. UNIX for Advanced & Expert Users

How to remove a file with a leading dash '-' in it's name?

Somehow someone created a file named '-ov' in the root directory. Given the name, the how was probably the result of some cpio command they bozo'ed. I've tried a number of different ways to get rid of it using * and ? wildcards, '\' escape patterns etc.. They all fail with " illegal option --... (3 Replies)
Discussion started by: GSalisbury
3 Replies

8. Windows & DOS: Issues & Discussions

My ATX System Doesn't power off

Hello! Will someone help me find a solution to my computer systems which as windows 98 and Windows 2000 professional running on it. After shutting down the system the system will prompt system shutdown but it will not power off the system( the system as an ATX power system) . Therefore it is as... (1 Reply)
Discussion started by: kayode
1 Replies

9. UNIX for Dummies Questions & Answers

quoting echo 'it's friday'

echo 'it's friday' why appear the > (3 Replies)
Discussion started by: yls177
3 Replies

10. Shell Programming and Scripting

Clearify what it means under 'WHAT' when hit the 'w'-command

I wonder how I shall read the result below, especially 'what' shown below. The result was shown when I entered 'w'. E.g what is TOP? What is gosh ( what does selmgr mean?)? login@ idle JCPU PCPU what 6:15am 7:04 39 39 TOP 6:34am 6:45 45 45 TOP 6:41am ... (1 Reply)
Discussion started by: Aelgen
1 Replies

Featured Tech Videos