AIX System Loses communication


 
Thread Tools Search this Thread
Operating Systems AIX AIX System Loses communication
# 1  
Old 12-20-2005
AIX System Loses communication

We run an RS/6000 SP Frame. One of the nodes running (AIX 5), in the frame, seems to run fine for a few weeks and then will no longer communicate with the outside world. I can not telnet to the server or ping the server.
This is true for both the ethernet and serial connection.

I believe what is happening is that there is a process running out of control, eating all the system resources and then finally bringing the system to its knees.

We are running Oracle 9i on the system as well.
Does anyone have any ideas how to monitor the system before it crashes.
I have looked into the syslogs and dumps but it doesn't seem to contain any information about why the system dies.

Any help would be appreciated.

Kevin
# 2  
Old 12-21-2005
CPU & Memory

Kevin,

If it is not a production server, try to stop the Oracle service and see the result. May be it is caused by the data load or misconfiguration between AIX and the Oracle.

Have you check the errlog. May be this would help. Get the error code and check on IBM site.

Monitoring the log is tedious job. In addition, monitoring the log before it's crash is not a solution to your problem.
# 3  
Old 12-21-2005
Thanks for the reply alisetan.
Unfortunatley the server is a production server and I will not be able to bring down the databases and stop the services.

I have run an errpt -a but the last entry before the server died was from 10 days ago and the next entry is from the system starting up.

I think at this point I am going to run ps on a 10 minute schedule and grep all oracle instances, and pipe it out to a file.

Unless anyone can think of an easier way?

Thanks for your help

Kevin
# 4  
Old 12-22-2005
I that's a good idea.
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. UNIX for Beginners Questions & Answers

New to AIX: How do I setup high availability on an AIX System

I am new to AIX but not new to unix. I have an interview for an AIX systems admin position and I know they want someone who has knowledge of High Availability, Failover and LPARs From my research so far, It appear powerha is used to setup high availability and failover on Power systems but is... (2 Replies)
Discussion started by: mathisecure
2 Replies

2. Shell Programming and Scripting

Variable loses value outside loop

Hi, I am looking for a global variable found_flag to be set a value that can be accessed outside the loop anywhere in the bash shell script. more test.sh found_flag=0 searchdir=/web/bea_apps/applications find . -type f \! -name tasty.tar | $AWK -F/ '{print $NF}' | while IFS=... (5 Replies)
Discussion started by: mohtashims
5 Replies

3. AIX

Accessing files on AIX system from Linux system

I have a following requirement in production system 1 : LINUX User: abcd system 2: AIX (it is hosting a production DB) Requirement user abcd from system 1 should have read access on archive log files created by DB on system 2. The log files are created with permissions 540 by user ora ,... (2 Replies)
Discussion started by: amitnm1106
2 Replies

4. UNIX for Dummies Questions & Answers

FTP loses write permissions

Hi Guys, i have learned today that when you ftp a file with full write permissions (777) to another destination it loses the w options. so a file that was once -rwxrwxrwx(before FTP) is now -rw-r--r-- (after FTP). why does this happen? and is it configurable? Regards, (8 Replies)
Discussion started by: brian112
8 Replies

5. UNIX Desktop Questions & Answers

Grep result loses formatting

I am searching for a string in a file and then redirecting the contents in another file... however the formatting is not preserved.. Can you please help me on this ... (5 Replies)
Discussion started by: blackeyed
5 Replies

6. AIX

AIX Printers moved to anothere AIX system

AIX Printers need to be moved to another system Guy's We have two servers old AIX 5.2 and new AIX 6.1 the old server has more than 300 printers installed with different configurations I'd like to move all the printers from the old server to the new server with fast steps it's... (1 Reply)
Discussion started by: ITHelper
1 Replies

7. AIX

How to apply aix 5.3 TL8 properly on ML5 aix system ?

Is it necessary to put system into single user mode for applying aix 5.3 TL8 on a aix 5.3.5.0 system ? Is the TL8 installation not totally safe ? thank you. (6 Replies)
Discussion started by: astjen
6 Replies

8. SCO

Unixware 7 curses/tcp loses characters ?

I have a program (binary) that ran well on UW 2.1 but on UW 7.1.4. characters are lost somewhere on the tcp/ip highway to my TUN emul. I mean, it is clear that what is on the screen is not what curses thinks there is. If I "tee" the program while running it (which probable slows down things ?),... (0 Replies)
Discussion started by: Geert.Anckaert
0 Replies

9. UNIX for Dummies Questions & Answers

copy loses the text format

Hi I try to copy part of text from one file to another file. My problem is the text in the new file loses all the format. My code is: #!/bin/sh while red line do if then echo "$line" >> ./new_file else break fi done < "./old_file" Is there a way to modify... (3 Replies)
Discussion started by: tiger99
3 Replies

10. Solaris

rsh commands not getting executed from Solaris 10 System to AIX System

Hi Friends, I am trying to execute rsh commands from Solaris 10 system to AIX system. When I give; Solaris10# rsh <hostname> ls -l , it gives me an error rshd : 0826-826 The host name for your address is not known At the same time, Solaris10# rsh <hostname> ---- gives me remote shell of... (25 Replies)
Discussion started by: jumadhiya
25 Replies
Login or Register to Ask a Question