What is on Your Mind?

View Public Profile for tetsujin

03-31-2011

Registered User

34, 0

Join Date: Apr 2009

Last Activity: 12 July 2011, 3:09 PM EDT

Posts: 34

Thanks Given: 0

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by Corona688

Quote:

In the design I've suggested here, the shell doesn't take responsibility for fixing that. (TTY sharing in a loop) Rather, it merely provides a mechanism that allows an external program to fix that...

Does that have to be done as part of the parallel loop? That sounds like something you might want to do in general, inside or outside a loop.

I'd considered that: the multiplexing/TTY sharing stuff would be useful outside the context of loops (or, if you prefer, at least as useful as it is in the context of loops.

) so addressing it as a general mechanism rather than something specific to loops would have advantages, make the shell language more orthogonal, etc.

One approach to doing this would be to have a separate step in which the PTYs and/or pipes are created to serve as /dev/tty, stdout, stdin for various jobs that are going to be run asynchronously: the file descriptors for these could be stored in array variables and passed to the loop construct, which would then pass them on to individual loop steps. Then probably one could come up with other cases where those FD arrays could be useful...

Quote:

You end up with shell scripts that require expect to run, and start spewing garbage to stdout if you try to disentangle them.

"expect" shouldn't be needed. Probably the sensible default for this "TTY sharing" stuff is to simply not do the TTY sharing thing if there's no TTY. ('course, there would be exceptions - cases where you might want TTY sharing to do its thing even if there's no TTY... If you're displaying the TTYs with xterms, for instance.)

Quote:

Utilities like ssh and su require password input to be from a terminal, but PTY's count. Something that gives you quick and easy access to PTY's, let alone in a high-performance parallel manner, isn't the sort of tool you want to just leave lying around

Hm, so is the problem just that people could use this capability to spoof input to programs that interact on /dev/tty?

I can live with that. Anyone who wants to do damage with that kind of capability can do it regardless of whether the shell serves up the functionality or they have to write their own program for it.

Quote:

(Interpreted languages can implement threading without threads)

That's just timesharing. If you want actual benefits from threading, you must do multithreading and/or multiprocessing.

Well, yeah, but my aim with "shell threading" is mostly just to keep things from getting booted into a "subshell context" simply because they're occupying a particular position in the command. But I'm starting to feel like it's probably not the best way to go. It'd mean keeping track of all the "threads" that are blocking on something, maybe stuffing all that into a big select() call...

Actually, though (I had to look this up) - it turns out when you fork() in a (Posix) threaded app, the new process has just one thread initially. The threads aren't cloned. So there'd be a little care necessary to make sure the environment's in a usable state prior to calling fork() but otherwise using Posix threads shouldn't be a problem.

tetsujin

Find all posts by tetsujin

03-31-2011

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Quote:

Originally Posted by tetsujin

"expect" shouldn't be needed.

I meant screen, sorry. Probably doesn't change the answer though.

Quote:

Probably the sensible default for this "TTY sharing" stuff is to simply not do the TTY sharing thing if there's no TTY.

But where would the output go instead?

Quote:

Hm, so is the problem just that people could use this capability to spoof input to programs that interact on /dev/tty?

I can live with that. Anyone who wants to do damage with that kind of capability can do it regardless of whether the shell serves up the functionality or they have to write their own program for it.

Sure, if given access to them. Just something to keep in mind.

Quote:

Well, yeah, but my aim with "shell threading" is mostly just to keep things from getting booted into a "subshell context" simply because they're occupying a particular position in the command.

External commands have to be in a seperate context. There's just no other way to run them. Builtins could be run in threads, so wouldn't need a separate context.

Even if you fork(), it's possible to share memory by other means. Memory you've created with mmap() can be shared between such processes. So just being in a subshell doesn't mean you'd have to lose access to variables. Environment variables though come preallocated so can't be shared that way without torturing your process environment in cruel and unusual ways.

Some builtins, like echo, might not need any context. When you know the amount of text is smaller than the size of your pipe, you know it'll never block -- so just cram the text in the write-end and close the write-end in advance. You don't need a new thread just to do that.

Quote:

But I'm starting to feel like it's probably not the best way to go. It'd mean keeping track of all the "threads" that are blocking on something, maybe stuffing all that into a big select() call...

Why not let the mutexes do the blocking?

Quote:

Actually, though (I had to look this up) - it turns out when you fork() in a (Posix) threaded app, the new process has just one thread initially. The threads aren't cloned.

Reference, please? Things I've seen suggest quite differently. Then again, that's something I saw inside the linux pthreads library, so that might be things they had to do to prevent threads being cloned rather than things I'd have to do...

If true, that would make it a ton easier.

Last edited by Corona688; 03-31-2011 at 08:15 PM..

Corona688

04-15-2011

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Another way to handle anonymous files could be similar to the awk way. When you redirect a file in awk, it stays open, referring to the same filename later just gets you the same handle over and over.

Code:

LINE=1
# Read lines from "otherfile" and "cmpfile" one by one, without
# redirecting the entire loop's stdin.
# Also set the close-on-exec flag for files opened in this fashion.
while read LINE <<"otherfile" && read OTHERLINE <<"cmpfile"
do
        if [ "$LINE" != "$OTHERLINE" ]
        then
                echo "Line $LINE doesn't match"
                echo "What would you like to do?"

                read RESPONSE
                case "${RESPONSE}" in
                *)   # todo: something
                      ;;
                 esac
        fi
        ((LINE++))
done

close "cmpfile" "otherfile"

Using << for 'keep the file open' seems a nice opposite of the >> 'append to file' redirection. This breaks the syntax for here-documents though. Not quite sure how to get the FD out of that either, maybe $<"filename" ?

Last edited by Corona688; 04-15-2011 at 04:35 PM..

Corona688

View Public Profile for tetsujin

04-16-2011

Registered User

34, 0

Join Date: Apr 2009

Last Activity: 12 July 2011, 3:09 PM EDT

Posts: 34

Thanks Given: 0

Thanked 0 Times in 0 Posts

Quote:

Originally Posted by Corona688

Quote:

Probably the sensible default for this "TTY sharing" stuff is to simply not do the TTY sharing thing if there's no TTY.

But where would the output go instead?

Same place it would go to normally. Whatever the shell sees as FD #1.

Obviously if you're using stdout as some kind of formatted value stream, you don't want a bunch of processes writing data to it concurrently without some kind of synchronization: this is why I described a stdout sharing mechanism (separate from the /dev/tty sharing mechanism).

If somebody ran a multithreaded loop whose jobs produced newline-delimited value streams - that's a common enough formatting convention that it could just be built in to the shell. If those processes were writing out JSON or some XML schema, then it's something the user should probably be handling, by providing a program that takes those output streams and merges them into one.

Quote:

Well, yeah, but my aim with "shell threading" is mostly just to keep things from getting booted into a "subshell context" simply because they're occupying a particular position in the command.

External commands have to be in a seperate context. There's just no other way to run them.

Yeah, mainly by "things" I meant builtins like variable assignment or "read". Things you would do to modify the environment, in cases where it's pretty reasonable for the user to expect that it'll modify the main shell's environment...

Quote:

Even if you fork(), it's possible to share memory by other means. Memory you've created with mmap() can be shared between such processes. So just being in a subshell doesn't mean you'd have to lose access to variables. Environment variables though come preallocated so can't be shared that way without torturing your process environment in cruel and unusual ways.

Hm, might have to think about that one. Of course you could get around it by simply copying all the env. variables to the shell's process memory and operating on those copies of the variables - a bit wasteful but a pretty simple dodge...

Quote:

Why not let the mutexes do the blocking?

Because I'm not talking about cases where threaded built-ins are blocking on some shared resource, I'm talking about them blocking on I/O, probably to a pipe as part of a larger job...

APUE chapter 12.9 explains the situation with pthreads and fork():
"Inside the child process, only one thread exists"
Of course I would be willing to bet there are some platforms or versions of platforms that got that wrong... APUE provides some coverage of different implementations to show differences but it'd take more time to tell you if it lists any for fork() in pthreads...

tetsujin

Find all posts by tetsujin

04-17-2011

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Quote:

Originally Posted by tetsujin

Hm, might have to think about that one. Of course you could get around it by simply copying all the env. variables to the shell's process memory and operating on those copies of the variables - a bit wasteful but a pretty simple dodge...

That's partway to just creating a new process then, since it would have some of the same side-effects -- changes from one wouldn't propagate back and vice versa.

Corona688