AIX - remote shell (sudo) - signal 11 core system 50


 
Thread Tools Search this Thread
Operating Systems AIX AIX - remote shell (sudo) - signal 11 core system 50
# 1  
Old 11-17-2014
AIX - remote shell (sudo) - signal 11 core system 50

Hi,
I am running a remote shell from site A to site B, where both are AIX. The remote shell starts other application, and when it finishes, it returns to the site A.
The problem is that I am receiving an error signal 11 and system core error 50 - segmentation fault.
Does anyone know if there are some configuration on remote shell channel, thread size, or some other place where I need to resize, in order to be able to run it? BTW, sometimes it works, sometimes not, and it seems that it depends on the application size that is running on site B (I am not sure for it yet).
The guy that is responsible for operating system said that he is not able to see anything wrong, and the problem is related to the application. The fact is, when the application runs at the same site, it works - any time.
It is a weird situation and I dont know what I need to do to trace or to be able to see the problem. I hope anyone could suggest me something such as parameters on the operating system site and this could be related to the size of thread, remote shell, etc..Oh, I was forgetting to say, the remote shell is executed under sudoers process, i.e, to run the remote shell on the other side, a sudo is necessary.
I would appreciate any help or hints for this issue.
tks.
# 2  
Old 11-17-2014
The most common reasons for an application to die with a segmentation fault (assuming no one explicitly sent it a SIGSEGV signal) are:
  1. using an uninitiaized pointer,
  2. buffer overflow (allocating a buffer of size x and writing into buffer[n] where n >= x or n < 0 [in C, valid array offsets are 0 to n-1]), or
  3. searching for the end of a string in a character array that does not include a terminating null byte.
# 3  
Old 11-17-2014
Check for environment variable differences when the app is run remotely. You could be loading the wrong shared object because of a different environment, for example. You might also be running into resource limits on the remote invocation, such as max memory usage.

Can you get a stack trace from a core file? If so, what is the app trying to do when it SEGVs? If you can get a stack trace, can you get a memory map? Where is the code being executed from? What shared library or executable?

Intermittent SEGVs can be tremendously hard to track down. Heap corruption from buffer overflows - the most common cause of intermittent SEGVs - tends to appear almost random at times because of the way heap memory tends to work. (Because of hardware alignment requirements, heap memory from "malloc()/calloc()/etc" and/or "new" tends to be parceled out in 8-byte blocks. So if you malloc() a 25-byte buffer, on most of today's hardware you really get 32 bytes...)

Are you responsible for developing this application? If so, have you ever tried something like Purify? Go look at the cost of that tool, then calculate how much time you've already spent trying to run down this ONE problem...
# 4  
Old 11-18-2014
Error

Hi,

As to analyse the trace map is a bit byte process, I would like to know something more about resize process or parameters, and answering the question if I can change the application, yes, I can.

I have heard some other hints, and one of them was: to identify the limits of the user that is running remotely. So, to do it, I used the command: ulimit, and the answer was: unlimited. the other hint was to know the limits of each machine, and on /etc/security/limits, I am able to see all of the parameters with -1, and it seems it is unlimited as well.

So, I am considering that I have no problem with limits, or machines, and the problem could be the size of the application that is handling some (or many) variables, contents, etc...(I am not sure for about it yet).

If someone knows other parameters, files, or any operating system issue that I need to check before, please, let me know, otherwise, I will start (and trying) changing the application.

tks.
# 5  
Old 11-18-2014
Are you saying your going to start making changes without knowing exactly what's failing?

That's known as Easter-egging.
# 6  
Old 11-19-2014
Hi anchenle,

yes, it is a easter-egging. the fact is that no one knows what is going on, even the operating system guy, it seems that there is no limit to be resized, and, if the amount of instructions were not so much, it works, so, this is we are considering that the size of the application could be one of the reason of the error. As it is possible to change the application and it does not depend on anyone, it would be easier to do it. However, as I said, if anyone can say something about the operating system parameters, or something like that, I would appreciate in receiving it. (my knowledge in Unix is zero)
one thing that I need to know it is if the process is executed inside any thread, because it could be that this thread would need to be bigger than it is defined.
tks.
# 7  
Old 11-19-2014
Easter-egging and intermittent SEGV is not easy.

If you don't know what's going on, you're not looking at the information you have.

Have you even tried to examine a core file from one of the SEGVs?

Have you even looked into trying any of the many memory corruption tools that are available?

https://www.google.com/search?q=aix+...rruption+tools
Login or Register to Ask a Question

Previous Thread | Next Thread

10 More Discussions You Might Find Interesting

1. Shell Programming and Scripting

Sudo connect to a remote server and execute scripts in remote server

Hello Every one!! I am trying to write a shell script which will connect to a remote server and execute scripts which are at a certain path in the remote server. Before this I am using a sudo command to change the user. The place where I am stuck is, I am able to connect to the... (6 Replies)
Discussion started by: masubram
6 Replies

2. Shell Programming and Scripting

Running remote system shell script and c binary file from windows machine using java

Hi, I have an shell script program in a remote linux machine which will do some specific monitoring functionality. Also, have some C executables in that machine. From a windows machine, I want to run the shell script program (If possible using java). I tried with SSH for this. but, in... (1 Reply)
Discussion started by: ram.sj
1 Replies

3. Shell Programming and Scripting

AIX system.... deleting files in remote directory after retrieving files

Hi Friends, I am new to this , I am working on AIX system and my scenario is to retrive the files from remote system and remove the files from the remote system after retreving files. I can able to retrieve the files but Can't remove files in remote system. Please check my code and help me out... (3 Replies)
Discussion started by: vinayparakala
3 Replies

4. Solaris

how to login with ssh to remote system with out applying the remote root/usr password

how to login with ssh to remote system with out applying the remote root/user password with rlogin we can ujse .rhosts file but with ssh howits possible plz guide (2 Replies)
Discussion started by: tv.praveenkumar
2 Replies

5. UNIX for Advanced & Expert Users

Using SCP command in IBM AIX to download file from remote to local system

Hi, When i run the code in solaris unix machine, the file from remote server is getting downloaded. but when i use the same code in IBM AIX remote machine, it is not running. It is saying "Erro during scp transfer." Below is the code. Please give some resolution. SCPClient client = new... (1 Reply)
Discussion started by: gravi2020
1 Replies

6. Shell Programming and Scripting

Run Shell Script on Remote System

I honestly tried searching for this in this forum and in google. Maybe I found the answer but didn't even realized it. I would like to run shell script thats on my machine that collects the hostname and IP address from the remote system and sends the output to my machine. I'm not sure if need... (2 Replies)
Discussion started by: elbombillo
2 Replies

7. HP-UX

Core dump with Signal 11.

Hi all, I am getting coredump in one of our programs. The last few lines of the tucs output are as below : read(8, "\adb\0\006\0\0\0\0\0060 0 9 2 6 ".., 2064) ...... = 2011 read(8, "\adb\0\006\0\0\0\0\07 4 1 4 5 03".., 2064) ...... = 2011 read(8, "\adb\0\006\0\0\0\0\00 2 6 030 0 ".., 2064)... (0 Replies)
Discussion started by: santosh.mane
0 Replies

8. AIX

How to remote backup with AIX system

I have some questions : 1) We can system backup manual by remotely? (Ex: we peform backup a AIX system in Japan from US)How we can do? 2) We can build a script to system backup automatically? How we can do? 3) We can use which third party to do that? Which tool? Thanks (1 Reply)
Discussion started by: guest
1 Replies

9. Programming

core dump - signal 9

Hi All, When I ran a multithreaded program in c++, it was core dumped when its core was analysed it indicated that the program was terminated with signal 9. Signal 9 cannot be caught or interrupted. When i analyzed the core further, it showed SING error what could be the reason for... (6 Replies)
Discussion started by: matrixmadhan
6 Replies

10. Programming

sigwait system call in UNIX signal

Hi Everybody, I have gone through man of sigwait and new to UNIX signals. Could anyone explain me about the following lines mentioned in sigwait man help ? "The selection of a signal in set is independent of the signal mask of the calling thread or LWP. This means a thread or LWP can ... (1 Reply)
Discussion started by: md7ahuja
1 Replies
Login or Register to Ask a Question