AIX performance


 
Thread Tools Search this Thread
Top Forums UNIX for Dummies Questions & Answers AIX performance
# 1  
Old 04-26-2006
AIX performance

Hiya all,

I am a newbie sysadmin to AIX, i have worked on HPUX for 3 years.

I have started a new role with in an IBM house and because there is me and one other there are a couple of issues I cannot work out:

We havehad a production server slowing down processing batch jbs over the past few nights - I have checked many things such as nmon stats, vmstats top procs and general performance from the machine.

We are getting hit by hit wait times in vmstat throughout the evening and know of certain jobs that run (business critical jobs) These are runniing up to 4hours longer.

Can you tell me the best way to monitor jobs / process etc so I can tell the "BOSS" what is causing the issues.

The main problem is that developers run queries to DB's on the server which we are currently running through a process to stop this.

any info will be helpful

Thanks in advance
# 2  
Old 04-26-2006
Use filemon. You can see then if there is a bottleneck in your disks or I/O somewhere. IMO it's probably the best tool there is for checking that, but you have to use it while the slow down is occuring.
# 3  
Old 04-27-2006
My guess is the IO subsystem.
Did you check on the IO average service time and average wait time during peak hour?
# 4  
Old 04-27-2006
here...

it is difficult to help minus details but try that:
1. if use SSA see that volume group is in a good health and you have no stale physical volumes, run defragmentation !
2. see system defs for maximum number of open files for a process and buffer limits for a process.
3. see in top what are the processes occupy most of the time, then in lsof figure out what takes it and then in iostat or vmstat see how the picture changes as you go trough steps 1 and 2.

Last edited by amro1; 04-27-2006 at 12:15 PM..
# 5  
Old 04-27-2006
Smilie Hi Thanks for the responses:

yes we have checked various subsystems during the issues - we have nmon graphs that show high wait times and also have alerting that proved wait times to be above 60 from the vmstat command.

I ran svmon:

--> svmon -G -i 2

size inuse free pin virtual
memory 3145689 3087096 58593 182176 858018
pg space 2785280 428652

work pers clnt
pin 182158 0 0
in use 913993 2173103 0


also topas and noticed lots of page faults due to paging in and out.

hdisk1 and 0 are heavily utilised pretty much all day as well as other system disks but I tend not to believe everything in topas.

We run ps awux > /tmp/monitoring.date.

this file is updated every 15 minutes and I find the following which apperently is normal system calls: (these are the top procs in the file every 15 mins)

root 2064 8.5 0.0 12 9008 - A 24 Feb 60881:40 kproc
root 1806 8.5 0.0 12 9008 - A 24 Feb 60821:30 kproc
root 1548 8.5 0.0 12 9008 - A 24 Feb 60818:49 kproc
root 1290 8.5 0.0 12 9008 - A 24 Feb 60703:20 kproc
root 2322 8.5 0.0 12 9008 - A 24 Feb 60685:25 kproc
root 1032 8.5 0.0 12 9008 - A 24 Feb 60554:57 kproc
root 774 8.4 0.0 12 9008 - A 24 Feb 60152:51 kproc
root 516 8.1 0.0 12 9008 - A 24 Feb 57866:27 kproc
root 3096 0.0 0.0 64 9052 - A 24 Feb 198:45 kproc
root 2580 0.0 0.0 12 9004 - A 24 Feb 139:47 kproc
root 2838 0.0 0.0 16 9012 - A 24 Feb 1:41 kproc
root 3354 0.0 0.0 16 9012 - A 24 Feb 1:10 kproc
root 32510 0.0 0.0 16 9012 - A 24 Feb 0:03 kproc
root 30446 0.0 0.0 16 9012 - A 24 Feb 0:02 kproc
root 582168 0.0 0.0 16 9004 - A 28 Feb 0:00 kproc
root 25284 0.0 0.0 16 9004 - A 24 Feb 0:00 kproc
root 25542 0.0 0.0 16 9004 - A 24 Feb 0:00 kproc
root 25800 0.0 0.0 16 9004 - A 24 Feb 0:00 kproc
root 25026 0.0 0.0 16 9004 - A 24 Feb 0:00 kproc


when I grep out defunct:

retail ps auwx Monitor on Mon 24 Apr 18:15:00 2006
rt07mszw 1228822 Z 0:00 <defunct>
rt05hdzw 925824 Z 0:00 <defunct>
rt0v9rzm 1064108 Z 0:00 <defunct>
rt0a5jzm 1990444 Z 0:00 <defunct>
rt07mszw 1772756 Z 0:00 <defunct>
rt07mszw 1733018 Z 0:00 <defunct>
rt06ggxp 1731806 Z 0:00 <defunct>
informix 246550 Z 0:00 <defunct>
rt07mszw 781804 Z 0:00 <defunct>
rt08cazm 807862 Z 0:00 <defunct>
informix 732496 Z 0:00 <defunct>
informix 671516 Z 0:00 <defunct>
retail ps auwx Monitor on Mon 24 Apr 18:30:01 2006
rt050azb 1280306 Z 0:00 <defunct>
informix 1502640 Z 0:00 <defunct>
rt0d5rws 1481808 Z 0:00 <defunct>
rt0j2czb 1410630 Z 0:00 <defunct>
rt0o5ayb 1410304 Z 0:00 <defunct>
rt0r5mza 1030858 Z 0:00 <defunct>
rt0o5ayb 1014478 Z 0:00 <defunct>
root 1914084 Z 0:00 <defunct>
root 1966324 Z 0:00 <defunct>
rt095req 1948512 Z 0:00 <defunct>
rt01mszm 1944508 Z 0:00 <defunct>
rt0d5rws 1682574 Z 0:00 <defunct>
root 455384 Z 0:00 <defunct>
informix 232872 Z 0:00 <defunct>
informix 732496 Z 0:00 <defunct>
informix 734412 Z 0:00 <defunct>
rt05adyk 551914 Z 0:00 <defunct>
rt0a2gzt 654196 Z 0:00 <defunct>


now these do disapear and repear with different PIDS.

any ideas?

Thanks
# 6  
Old 04-27-2006
If you are getting a lot of paging during this time check to see if your paging space is setup correctly as well.

A few questions you might bring up -
When was the last time the box was rebooted? If you have any memory leaks this will clean that up.

Has the number of apps increased on the box since it was bought? Does it need an actual memory upgrade?

Check performance and tuning guide in relation to what the vendor recommends.

I still recommend running filemon to see if you have a disk bottleneck. Your paging can increase if there is a bottleneck and writes are taking longer and longer to compelte. If so, you would need to move around your LV's in order to increase performance.
# 7  
Old 04-27-2006
anyone give me some info from this output from top:

PID USER PRI NICE SIZE RES PFLTS STAT USER/SYSTIME CPU% COMMAND
0 root 0 -20 12k 8920k 0.0 non 0:00/ 6:43:22 99.7/ 0.4


it appears periodically in TOP and sometimes display's more than 1 process.

Thanks
 
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. AIX

Performance stats from AIX 6

Hi, I'm supposed to capture many performance stats on AIX 6 and stuck up with below: Priority queue Disk cache hit% Page out rate Swap out rate Memory queue I see vmstatis helpful for "page out" but not sure how to get the "rate". Could anyone please let me know how to get these... (4 Replies)
Discussion started by: reddyr
4 Replies

2. AIX

AIX 6.1 Memory Performance issues

Good Day Everyone, Just wonder anyone has encounter AIX 6.1 Memory Performance issues ? What I have in my current scenario is we have 3 datastage servers (Segregate server and EE jobs - for those who know Datastage achitect) and 2 db servers(running HA to load balance 4 nodes partitions for... (3 Replies)
Discussion started by: ckwan
3 Replies

3. AIX

AIX 5.3 performance problems

Hello, I encounter some performance issues on my AIX 5.3 server running in a LPAR on a P520. How do I investigate performance issues in AIX. Is there any kind of procedure that takes me to the steps to investigate my server and find the sub systems that is causing the issues? The performance... (1 Reply)
Discussion started by: petervg
1 Replies

4. AIX

Asynchronous I/O on AIX and DB performance

Hi, I would like to hear your thoughts about this. We are running our Data warehouse on DB2 DPF (partition environment) and I have notice that sometimes we hit the Asynchronous-I/O-Processes peak. DB2 relies heavily on Asynchronous I/O so I would believe this has an negative impact.We are... (10 Replies)
Discussion started by: arizah
10 Replies

5. AIX

Need some help for AIX performance monitoring

Hello I am new user of AIX; I have only basic knowledge of the UNIX commands, and I want to create script that will monitor the performance and resources usage on AIX 6.1 machine. Basically I wan to start a loop that will grab, every 10 seconds, the CPU usage, the memory usage, the disk usage,... (1 Reply)
Discussion started by: adaher
1 Replies

6. AIX

AIX performance issue

Hi Guys, This is the situation I am in. Provide your views and input where should I start? I have one P7 test server and a p520 production server. the job is taking pretty long on the P7 test server when compared to the P5 production server. below is the full detail. Informix... (5 Replies)
Discussion started by: ithinksgreen
5 Replies

7. AIX

performance issue in AIX

Gurus, i have process that runs 5 times a day. it runs normally (takes about 1 hour) to complete in 3 runs but it is takes about ( 3 hrs to complete) two times So i need to figure out why it takes significanlty high time during those 2 runs. The process is a shell script that connect to... (2 Replies)
Discussion started by: mad_man12
2 Replies

8. AIX

Performance issue in AIX 5.3

Is there is any way to increase the CPU utilization of a Embedded SQL program in AIX 5.3 .. for performance purpose. (0 Replies)
Discussion started by: Gyanendra Awast
0 Replies

9. AIX

AIX 5.2 performance question

I am trying to analyze the performance of an AIX system. I think I may have a disk I/O issue, but I am asking for help to validate or invalidate this assumption. I ran the commands below during a period of peak load. Please help me to find any performance bottlenecks. Thanks in advance for your... (15 Replies)
Discussion started by: jhall
15 Replies

10. AIX

Performance testing on AIX

I'm doing performance testing for one application which works on AIX. But I don't know which performance parameters of memory need to be collected. Now, I just know very few: 1. page in 2. page out 3. fre They are all collected by "vmstat" command. I want to know, except for above... (2 Replies)
Discussion started by: adasong
2 Replies
Login or Register to Ask a Question