Perl thread question

Eric · Feb 16, 2007

Hello,

I was given a task that requires me to launch multiple executions of
the same script using different args as input. Instead of waiting for
the first to complete before starting the second one, I need to run
them in their own thread (i.e. parallel).

Here is what I've done so far. The script I am executing is called
'log_on_off.exp', which is an Expect script that logs on, then off of
a console, which is provided as input to the command:

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
use Config;
use threads;

my @cmdline = ("log_on_off.exp mach003-con",
"log_on_off.exp mach022-con",
"log_on_off.exp mach030-con");

foreach (@cmdline) {
my $thr = threads->new(\&runExpectScript, $_);
}

sub runExpectScript {
my $cmd = $_;
my $runScript = `$cmd`;
}
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

After running this script, I can see that all three executions are
running as separate processes, which is what I would expect:

$ ps
PID TTY TIME CMD
2150 pts/1 00:00:23 bash
30151 pts/1 00:00:00 log_on_off.exp
30153 pts/1 00:00:00 log_on_off.exp
30155 pts/1 00:00:00 log_on_off.exp
30268 pts/1 00:00:00 ps
$

It turns out that I need the result of the subroutine to determine if
the login, logout was successful. So I tried using the join function
as follows:

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
use Config;
use threads;

my @cmdline = ("log_on_off.exp mach003-con",
"log_on_off.exp mach022-con",
"log_on_off.exp mach030-con");

foreach (@cmdline) {
my $thr = threads->new(\&runExpectScript, $_);
my $retResponse = $thr->join;
}

sub runExpectScript {
my $cmd = $_;
my $runScript = `$cmd`;

if ($runScript =~ m/successful/) {
print "SUCCESS\n";
} else {
print "FAILURE\n";
}
}
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

When I run this Perl script in the background, I never see more than
one log_on_off.exp process running at any time. According to the
document I am using as a learning tool (http://perldoc.perl.org/
perlthrtut.html), the join function waits for the thread to exit
before continuing. I interpret this as saying that the second command
will not be started until the first returns, which defeats the purpose
of my using Perl threads to begin with, since these commands are being
executed serially (which is no different than running them in a loop
without even using Perl threads). Is my interpretation correct?

Is there any way I can execute these commands and parse the input in
their own thread instead of waiting for the previous command to
finish? Of course, if this can be done, I need to be able to determine
what thread returned what value.

Thanks in advance to all that respond.

Eric

J. Gleixner · Feb 16, 2007

Parallel::ForkManagerEric said:
Hello,

I was given a task that requires me to launch multiple executions of
the same script using different args as input. Instead of waiting for
the first to complete before starting the second one, I need to run
them in their own thread (i.e. parallel).

Maybe take a look at Parallel::ForkManager.

Eric · Feb 16, 2007

Eric said:
Eric said:

Hello,

Click to expand...

I was given a task that requires me to launch multiple executions of
the same script using different args as input. Instead of waiting for
the first to complete before starting the second one, I need to run
them in their own thread (i.e. parallel).
[...]

It turns out that I need the result of the subroutine to determine if
the login, logout was successful. So I tried using the join function
as follows:

Click to expand...

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
use Config;
use threads;

Click to expand...

my @cmdline = ("log_on_off.exp mach003-con",
"log_on_off.exp mach022-con",
"log_on_off.exp mach030-con");

Click to expand...

foreach (@cmdline) {
my $thr = threads->new(\&runExpectScript, $_);
my $retResponse = $thr->join;
}

Click to expand...

sub runExpectScript {
my $cmd = $_;
my $runScript = `$cmd`;

Click to expand...

if ($runScript =~ m/successful/) {
print "SUCCESS\n";
} else {
print "FAILURE\n";
}
}
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Click to expand...

When I run this Perl script in the background, I never see more than
one log_on_off.exp process running at any time. According to the
document I am using as a learning tool (http://perldoc.perl.org/
perlthrtut.html), the join function waits for the thread to exit
before continuing. I interpret this as saying that the second command
will not be started until the first returns, which defeats the purpose
of my using Perl threads to begin with, since these commands are being
executed serially (which is no different than running them in a loop
without even using Perl threads). Is my interpretation correct?

Click to expand...

Yes. You're mixing up two different things here: the execution
of the threads and the collection of their results. In your example
you're in fact waiting for the result of one thread before starting
the next one and making the use of threads obsolete.

If you want to parallelise threads, you have to first start all of
them, then collect all of them, like with

my @thrdlist;

foreach( 0 .. $#cmdline )
{
$thrdlist[$_] = threads->new( \&runExpectScript, $cmdline[$_] );

}

my @responses;

foreach( 0 .. $#cmdline )
{
$responses[$_] = $thrdlist[$_]->join();

}

In that example the threads are all fired as quickly as Perl permits,
then the result collection loop runs until all of them are finished
and their results can be fetched.

Note that it does hardly matter in what order the threads finish, as
when one of the first threads takes longer than later ones they simply
wait in the queue until they are join()ed. So the speed loss is just
the execution time of the remaining loop iterations (fetching the
return value and destroying the thread).

Is there any way I can execute these commands and parse the input in
their own thread instead of waiting for the previous command to
finish? Of course, if this can be done, I need to be able to determine
what thread returned what value.

Click to expand...

That threads topic can give a lot of headache, that I know from
experience, but usually things tend to be a lot simpler than one
would have thought

btw., I noticed that in your script you have
sub runExpectScript {
my $cmd = $_;

You surely want to use
my $cmd = shift;
or
my $cmd = $_[0];
here, as it's more of an accident that $_ holds the correct value at
this point and the correct place to look for subroutine args is @_.

-Chris- Hide quoted text -

- Show quoted text -

Thanks for your response, Chris. After I submitted my initial entry, I
had an enlightenment and tried the following:

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
#!/usr/bin/perl

use strict;
use warnings;

use Config;
use threads;

$Config{useithreads} or die "Recompile Perl with threads to run this
program.";

# Put all of the command line args in an array.
my @cmdline = ("log_on_off.exp mach003_con",
"log_on_off.exp mach022_con",
"log_on_off.exp mach030_con");

# Execute each command in the array in it's own thread.
foreach (@cmdline) {
my $thr = threads->new(\&runExpectScript, $_);
sleep 1;
print "Value of \$thr is: $$thr\n";
my $ps = system("ps"); ##
}

# Specify the join function for each thread in the list.
for my $t (threads->list) {
print "Value of \$t is: $$t\n"; ##
$t->join;
}

sub runExpectScript {
my $cmd = shift;
print "The command line is: $cmd\n";
my $runScript = `$cmd`;
#print "The contents of \$runScript are: $runScript\n";
if ($runScript =~ m/successful/) {
print "SUCCESS: $_\n";
} else {
print "FAILURE: $_\n";
}
}
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

This seems to work. Here is the output I get:

vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
$ m.pl
The command line is: log_on_off.exp pdp003 pdp003-ilo
Value of $thr is: 140820824
PID TTY TIME CMD
2150 pts/1 00:00:24 bash
4194 pts/1 00:00:00 m.pl
4196 pts/1 00:00:00 log_on_off.exp
4235 pts/1 00:00:00 ps
The command line is: log_on_off.exp pdp022 pdp022-ilo
Value of $thr is: 141287696
PID TTY TIME CMD
2150 pts/1 00:00:24 bash
4194 pts/1 00:00:00 m.pl
4196 pts/1 00:00:00 log_on_off.exp
4237 pts/1 00:00:00 log_on_off.exp
4276 pts/1 00:00:00 ps
The command line is: log_on_off.exp osdc-pdp030 osdc-pdp030-ilo
Value of $thr is: 141228056
PID TTY TIME CMD
2150 pts/1 00:00:24 bash
4194 pts/1 00:00:00 m.pl
4196 pts/1 00:00:00 log_on_off.exp
4237 pts/1 00:00:00 log_on_off.exp
4278 pts/1 00:00:00 log_on_off.exp
4316 pts/1 00:00:00 ps
Value of $t is: 140820824
SUCCESS: log_on_off.exp pdp003 pdp003-ilo
Value of $t is: 141287696
SUCCESS: log_on_off.exp osdc-pdp030 osdc-pdp030-ilo
SUCCESS: log_on_off.exp pdp022 pdp022-ilo
Value of $t is: 141228056
[ecarlson@ecarlson-dev1 remboot]$
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

As you can see, it appears that the processes are all started and
running as separate processes, as I would expect. The results are not
given in any particular order (which is ok), but appear to be
associated with the correct process based on command line. (I tried
some failure conditions and found that they match up.) The main
difference between my and your approach is that you went to extra
effort to identify each process by a number, which is probably a more
sure way than what I did. Do you feel that my approach will work
despite the fact that I didn't do this?

Thanks.

Eric

Eric · Feb 16, 2007

Maybe take a look at Parallel::ForkManager.

Thanks for your response. This is certainly worth my looking into, if
anything for future reference.

Eric

Eric · Feb 16, 2007

Eric wrote:

[fire-first-collect-later code snipped]

PID TTY TIME CMD
2150 pts/1 00:00:24 bash
4194 pts/1 00:00:00 m.pl
4196 pts/1 00:00:00 log_on_off.exp
4237 pts/1 00:00:00 log_on_off.exp
4278 pts/1 00:00:00 log_on_off.exp
4316 pts/1 00:00:00 ps
Value of $t is: 140820824
SUCCESS: log_on_off.exp pdp003 pdp003-ilo
Value of $t is: 141287696
SUCCESS: log_on_off.exp osdc-pdp030 osdc-pdp030-ilo
SUCCESS: log_on_off.exp pdp022 pdp022-ilo
Value of $t is: 141228056
[ecarlson@ecarlson-dev1 remboot]$
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Click to expand...

As you can see, it appears that the processes are all started and
running as separate processes, as I would expect. The results are not
given in any particular order (which is ok), but appear to be
associated with the correct process based on command line. (I tried
some failure conditions and found that they match up.) The main
difference between my and your approach is that you went to extra
effort to identify each process by a number, which is probably a more
sure way than what I did. Do you feel that my approach will work
despite the fact that I didn't do this?

Click to expand...

If you're only going for some lines of text to the screen, I don't
see why not. Though you could also return a more complex data
structure that holds both the task identification and the result and
avoid the counter this way - like always, TMTOWTDI[1] - if you
want to work on the results some more later on in your script.

But I see in your code that you're still using $_ in the subroutine
and laying out traps for yourself (and I admit not being completely
clean there with my own example). Especially with threads, where
things may happen in random order, it's always wise to keep from
$_ as far as possible. Shift into variables in the sub, and use a
named iterator in the loops, like

foreach my $count ( 0 .. $#cmdline ) {
do_something_with( $count );

}

or

foreach my $nextcommand ( @commandline ) {
do_something_with( $nextcommand );

}

or tracing back random errors may become a hell of a job once the
projects get a bit more complex.

-Chris

1) There's More Than One Way To Do It, the Perl(5?) philosophy.- Hide quoted text -

- Show quoted text -

Good point on the $_, Chris. I usually try to take the Perl shortcut
to doing things. But sometimes that may not always be the best
approach, and possibly threads is one such case where it is not the
wisest thing to do.

Eric

Perl calling ps - COLUMNS ignored?	1	Jul 10, 2013
C# How to convert date into en-US when thread culture is ar-SA	1	Feb 18, 2021
thread problem	1	Sep 6, 2013
Brocade Switch Perl Script	1	Aug 19, 2016
Perl threads - capturing value returned from sub	6	Feb 27, 2007
Python Gurobi Optimizing Cost has no errors but I get no sensible solution	0	Aug 30, 2022
Question from the Perl Thread Tutorial	6	Dec 16, 2006
java thread question	9	Apr 3, 2014

Perl thread question

Eric

J. Gleixner

Eric

Eric

Eric

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads