Process lose its parent then consume high CPU usage ...
Hello.
In an informix context, on AIX 5.3 TL 12, we encounter this problem :
Sometimes in the day (probably when users exits from their session), a child process lose its parent (PPID is now "1") and this child is consumming lot of CPU "USER".
I tried, on different cases, "truss -p <mypid>" : sometimes (for some processus) it says "truss: 0915-023 Cannot control process #950712", sometimes (for other processus) it waits and print nothing.
I tried procstack <my pid> :
Do you have any clue of what happend ?
I think of the way users are stopping their sessions ... but I dont know how it could cause these symptoms.
The last operations were : upgrade AIX 5.3 to TL12 (before upgrading to 7) and move partition from Power 6 to Power 7.
Thank you !
Moderator's Comments:
Please use CODE tags as required by forum rules!
Last edited by RudiC; 11-15-2016 at 12:26 PM..
Reason: Changed [B] to CODE tags.
nobody has a magic crystall ball, but it seems that your user just pressed the close button of the window. because the terminal was lost, the system sent SIGHUP (hangup) signal to all applications, which were running in the session. your application can understand this signal and should be able to exit clean. Your stack shows, that it called standard C function exit(). After this call the process should die.
IS the stack dumpfrom the process running out of control? I do not think so - as agent_kgb said, the process should end on an exit() call. Process running amok just crank cpu until somebody kills it.
It definitely is a user training issue. Closing the desktop window will cause a SIGHUP to be sent to the UNIX processes. The process you are seeing is probably forked over to whatever the informix user has, not the original user account. Check the owning username to be sure. It affects where to look for code that is trapping the SIGHUP signal. Which your problem seems to be - a guess.
I agree with you, but I don't understand why the behavior changed.
We updated a 5.3 TL 9 AIX to 5.9 TL12 and we also move from a physical server to a VIO client. Before this migration, we didn't have the problem.
It happens with multiple users (more than 10 different users for the last period I checked) now in 5.3 TL12 but nothing similar happens with another server, with same application, in AIX 5.3TL9.
Location: on the road for work; home is private time
Posts: 456
Thanks Given: 10
Thanked 108 Times in 100 Posts
Not a happy thought, but maybe it behaved well before because of a bug that got fixed with the update, or worse - a bug got introduced.
As to running on Power7 - in native mode, or in a vWPAR. In either case I suspect more than just the standard upgrade to AIX 5.3 TL12 (from memory SP7). In extended support there were several more updates (I have never seen).
Maybe among the "extended support" updates there is something extra you need.
Hi Fellas,
Not sure how I can dig in even further but we notice that one of our DB servers is showing high Sys% CPU usage even though when I execute the following command :
I can see that postgres is the only one using the CPU. So if anyone can advise me what would be the best way to... (3 Replies)
HI
In my M5000 , one of domain is having SAp installed. from today onwards we are facing some stange issue.
when we start SAP application, that particular user is taking 95 % of system CP and renaming 5 % is taken by system . because of this reason application is slow.
i have 4 CPU(32... (4 Replies)
I have created one script and i have added it into cron to run after 10 mins. However I have noted that whenever that script runs, It causes CPU utilization of server to increase about 10-20 %
I have rechecked script and there is no way i can make changes, Script contain only 2-3 commands. So... (4 Replies)
Dear All,
One production Server is affected by high usage.
Application is slow now. Please guide me how to solve it?
NMON Report shows full cpu usage.
Here im posting some server details.
bash-3.2# lparstat -i
Node Name : *********
Partition Name ... (6 Replies)
Hi,
I can't seem to make sense of this. My wait time is showing really high but vmstat's and topas are showing normal usage.
ps aux
USER PID %CPU %MEM SZ RSS TTY STAT STIME TIME COMMAND
root 9961810 5680.7 0.0 448 384 - A Dec 16 6703072:12 wait
... (2 Replies)
Hi All,
Yesterday my Linux server went panic and even a small command took a lot of time to run.
When i monitored pl find the below output
Cpu(s): 0.1%us, 98.4%sy, 0.0%ni, 1.5%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st
the time spent on kernel mode is 98 % and also idle time is around 1.5 %...... (4 Replies)
I am running a Dell PE R815 with 4 x AMD 12 core CPUs with 128GB of RAM and a RAID 5 array of 6 SAS disks. This is an HPC application and is definitely CPU bound, however once I run 16 of these processes (thus pinning 16 cores) the work performed slows down dramatically, to maybe 5 or 10% of what... (2 Replies)
Hi,
I have problem like everyday i have to check which process consuming more cpu time. I have done it manually using top command..
Is there any script which will tell the exact process name which will consuming more time. I am using hpux. (1 Reply)