Msgget(2) returns 0 - a workaround fix


 
Thread Tools Search this Thread
Top Forums Programming Msgget(2) returns 0 - a workaround fix
# 1  
Old 06-23-2014
Msgget(2) returns 0 - a workaround fix

Greetings:

I am posting this because my searches for this problem only came up with two posts and no helpful suggestions. I have a "solution" (read work-around hack) and have not tried yet to find a root cause, and may never because I am busy doing other things (read working to pay the bills).

However, I post this with two goals:
1. For the poor shmuck at 3am
2. document in case someone really has a wild hair (hare?) up their butt

Simply put, msgget(2) will return 0 for some reason, which the msgsnd(2) and msgrcv(2) do not like. My notes indicate msgsnd() was OK, and msgrcv() complained, but this was 12 hours into a debugging session....

There are two threads I have found in the interwebs:
forums.codeguru (dot) com/showthread.php?403036-strange-problem-in-using-msgget%28%29-in-Linux
and
unix (dot) com/programming/3755-about-msgget-troble.html

Both of these threads are "old" and closed, otherwise I would have responded to one of them.

NOTE: The codeguru.com has the best code example. The unix.com code has what may be a fatal flaw: it uses IPC_EXCL as part of the permissions - so the second time it is run it should complain, unless he first removed the message queue. However, he should have gotten errno == EEXIST and it appears he did not - he does print errno.

The Linux distro is Ubunto 8, not patched. Because the other posts are from 2006 and 2005, the CPU does not seem to be an issue.

The interesting thing is:
Running ipcs gives (in addition to various semaphores and shared memory):

Code:
 ------ Message Queues -------- 
key        msqid      owner      perms   used-bytes   messages 
0x000000f0 163840     gfi        666        0            0
0x0000007b 32769      gfi        666        0            0

The original key was 0xF0 which returned 0x8000 when it was working. The hex for the decimal 163840 = 0x28000. I arbitrarily tried a key of 0x7B (well, decimal 123) and got a msgqid = 0x8001 (which == 32769 decimal).

I also see cases in my slime trail that when msgget() was returning non-zero, for a while it returned 0x10001. In all cases I am using an int to hold the msgQ_id. The key = 0xF0 returns 0, not 0x8000, so truncation is not an issue. I have not tried switching back to a key = 0xF0. I will try looking on another system running the same code (ie using 0xF0) to see what ipcs shows.

Another thing: 0 is supposed to be a legal return:
Quote:
Upon successful completion, msgget() returns a non-negative integer, namely a message queue identifier. Otherwise, it returns -1 and errno is set to indicate the error.
So - I don't know why msgget() will start returning 0. Honestly, I had another bug which (for a while) masked what msgsnd() was doing - a "(u)" instead of a "(%lu")" printf was throwing SIGSEGV (sigh) and I fixed both at the same time (ie new key) - this is a non-trivial system to run a code build on && one wants to do as much as one can between runs.

The only suggestion I can make is have the system come up with a unique key using ftok() every time, and remove old message queues. A good start on a key would be the parent process PID.

(please forgive the chopped links - apparently I am not yet blessed to give raw links yet :^)
# 2  
Old 06-23-2014
We can't tell why your code is breaking down either. Certainly not without seeing it.
# 3  
Old 06-24-2014
I would gently point out I provided a link to a post WITH the code Here it is. The only two changes from the codeguru example and mine is I changed the key. (Obiously, one should ALSO check for < 0 ... and I added that)

Code:
    msgid = msgget((key_t) 0xF0, 0666 | IPC_CREAT);

if( msgid < 0 ) {
printf(" Error in creating queue!!, errono = %d\n",errno);
exit( 0 );
}

    if (msgid == 0) {
      printf(" Got msgid == 0!!, errono = %d\n",errno);
      exit(0);
    }

Also, I was able to look on another development machine. It uses the same key as above (0xF0). ipcs reports the queue Id = 0x8000

Also - please note I am mainly posting this so that some poor programmer in the future with this problem can find this post.

The only folks who could provide any real answers, no offense, are the maintainers of msgget() and family. A link or contact point with them would be most appreciated.
This User Gave Thanks to mr_bandit For This Post:
# 4  
Old 06-24-2014
What OS?

And what's the value of errno after you get a zero back from msgget()? (And remember to set errno to zero before calling msgget()...)
# 5  
Old 06-24-2014
msgid is NOT a message it is a message queue id. (A shared (IPC) memory object, not an individual message) A return of msgid ==0 means success. Any number >-1 == success.

Since you really did not post much code --
You should be calling msgrcv like this (note infinite loop is NOT required):
Code:
#include <stdio.h>
#include <stdlib.h>
#include <errno.h>
#include <sys/types.h>
#include <sys/ipc.h>
#include <sys/msg.h>

struct my_msgbuf 
{
 long mtype;
 char mtext[200];
};

int main(void)
{
   struct my_msgbuf buf;
   int msqid=0;
   key_t key;
   if ((key = ftok("[your key code goes here]", 'B')) =< -1)  /* same key as other program  edit:changed to =< */
   {
     perror("ftok");
     exit(1);
   }
   if ((msqid = msgget(key, 0666)) == -1)  /* connect to the queue */
   {
      perror("msgget");
      exit(1);
   }
   printf("Ready to receive messages\n");
   for(;;) 
   { 
      if (msgrcv(msqid, &buf, sizeof(buf.mtext), 0, 0) == -1)
      {
        perror("msgrcv");
        exit(1);
      }
      printf("%s\n", buf.mtext);
   }
   return 0;
}

The above snippet works correctly, I use it in other code....

Last edited by jim mcnamara; 06-24-2014 at 11:25 PM..
This User Gave Thanks to jim mcnamara For This Post:
# 6  
Old 06-24-2014
Read this:

strange problem in using msgget() in Linux

OP is stating that he's seeing the same problem as posted at codeguru years ago: when msgget() returns 0, the message queue doesn't work.

Hence my asking about errno values for when the msgget() returns 0, and when msgsnd()/msgrcv() fail with the zero message queue ID.

---------- Post updated at 03:48 PM ---------- Previous update was at 03:48 PM ----------

Read this:

strange problem in using msgget() in Linux

OP is stating that he's seeing the same problem as posted at codeguru years ago: when msgget() returns 0, the message queue doesn't work.

Hence my asking about errno values for when the msgget() returns 0, and when msgsnd()/msgrcv() fail with the zero message queue ID.
# 7  
Old 06-24-2014
errno after msgget() returns 0

To be honest, I did not look at errno after msgget() returned 0 (It was in the wee early hours and I had bigger fish..). I will try that and get back with the results. This will also tell me if the original key returns 0.

The interesting thing is we loaded a *way* earlier version of the code to test a completely different thing. I jut did a ipcs and got

Code:
------ Message Queues --------
key        msqid      owner      perms      used-bytes   messages
0x000000f0 0          gfi        666        0            0

where you can see msgqid == 0 and the system seems to be performing happily (at least this part of it). This would seem to enforce the observation / theory that once the msgqid becomes 0, it stays 0. No complaints from msgsnd() or msgrcv().

So - this may be a brainfart on my part. After all, I did have another bug throwing a SIGSEGV at the same time. And it was very late/early. If it is a brainfart, my apologies.

Again, this an unpatched Ubuntu 8

@achenlehas an interesting suggestion. I had not thought of setting errno to a value before making the call. The question then becomes: why not set it to (-1)? errno values are positive, at least on Linux. (I seem to remember them being negative numbers on BSD 4.1, but that was a *long* time ago...)
Login or Register to Ask a Question

Previous Thread | Next Thread

9 More Discussions You Might Find Interesting

1. AIX

AIO workaround AIX 5.3 to AIX 7.1

Hello Folks, Facing a problem starting Apache Services on AIX 7.1 This is the error i'm getting /oraapp/prodora/iAS/Apache/Apache/bin/apachectl start: httpd started Syntax error on line 17 of /oraapp/prodora/iAS/Apache/modplsql/cfg/plsql_pls.conf: Cannot load... (0 Replies)
Discussion started by: filosophizer
0 Replies

2. UNIX for Advanced & Expert Users

stuck in CLOSE_WAIT Solaris 10 - Patch and workaround

Solaris 10 Sparc: When you got a connection locking a tcp/port, and the status is CLOSE_WAIT (for ever :wall:), you just use the tcpdrop, to close the connection. This is a OS bug. I wrote the bug id bellow: BUG-ID 6468753 connections stuck in CLOSE_WAIT The patch that's correct the bug:... (0 Replies)
Discussion started by: thiagofborn
0 Replies

3. Programming

msgget message queue always get permission denied

I want to use msgget() to obtain a message queue between two processes, here is my code: the first one create the mq, the second one open it and add a message to it. But when I execute the second one, I get permission denied. I've already desperately tried everything I can think of to solve this... (2 Replies)
Discussion started by: tefino
2 Replies

4. Shell Programming and Scripting

Calculation returns no value

#/bin/sh ..... #convert memory to MB let "mmsize_a= ($mmsize)/256" let "mminuse_a= ($mminuse)/256" let "mmfree_a= ($mmsize_a -$mminuse_a)" let "mmfreepercent= (($mmfree_a)/($mmsize_a))*100" # #format output echo "\n\n######################" >>$sndFile echo "\n$sysName Total Memory usage"... (3 Replies)
Discussion started by: Daniel Gate
3 Replies

5. Windows & DOS: Issues & Discussions

Samba trouble shoot / workaround ?

Hello, I've setup a ubuntu 10.04 server running samba 3.4.7 as domain controler / file server at a customer site, that works great most of the time but I face a random problem. Of course I'm never on the site when the problem occurs, so I cannot investigate in real time. What happens is that... (2 Replies)
Discussion started by: Manu.b
2 Replies

6. UNIX for Dummies Questions & Answers

Workaround for macros in sftp command

Hi, I've some existing scripts wherein am using ftp + .netrc. I've defined my macros in .netrc file. I want to switch to sftp now but it seems it doesn't support macros and .netrc and it gives "command invalid" error. Is there any other alternative? Note: I don't want help for... (1 Reply)
Discussion started by: ps51517
1 Replies

7. Programming

Problem with msgget()

Hi, I am having problem with msgget() function. Here is the problem that I am having on Unix : I have two processes sender and receiver. Sender generates queue (msgget()) with some key e.g. 938, for output. Receiver reads from the same queue. i.e. receiver also tries to get queue... (2 Replies)
Discussion started by: Ashwini
2 Replies

8. Shell Programming and Scripting

find & sed -i work fine. Now need -i workaround for old OS.

I have a script that does a search and replace on a tree using find, xargs and sed that looks something like this. find . -type f -print0 | xargs -0 sed -i 's/fromthis/tothis/g' Now this works fine on new versions on Linux but I need to make the script work on an old RAQ550 that has an older... (3 Replies)
Discussion started by: simonb
3 Replies

9. Programming

about msgget troble

hi,all i have in trouble about msgget. i create a queue and the program like blow: openMsg( pid_t key ) { .... int msgid; .... msgid=msgget(key,IPC_CREAT|IPC_EXCL|0666) if( msgid<=0 ){ fprintf( stdout,"%s,%d",strerror(errno),errno ); return -1; ... (9 Replies)
Discussion started by: subrain
9 Replies
Login or Register to Ask a Question