Threads, waiting for last one to finish

Roedy Green · Jan 14, 2013

I have difficulty seeing how you wouldn't be able
to figure out how many threads you propose to start.

I easily do it, it is just that in my case it takes an extra pass.

Arved Sandstrom · Jan 14, 2013

Yes, no, it depends.

It is implementation dependent so it can and do differ between
different vendors and platforms.

Some may provide an option for changing the default.

I have once read that SUN/Oracle Java on Windows and Linux uses
320 KB and 1024KB as default on 32 and 64 bit respectively.

SUN/Oracle Java do provide the -Xss option to change the
default.

Arne

I use the -Xss on occasion for some apps, sometimes it's part of their
tuning docs. Usually I don't worry about thread stack size, not if I've
got a relatively small number (25 certainly being small) of threads.

Not when I need GB of memory for heap, and hundreds of megs for perm gen.

My approach here is my usual approach - go for a simple design, and if a
quick gut-check of performance implications raises no red flags then go
with that for implementation.

AHS

Knute Johnson · Jan 14, 2013

I am surprised that this got just one mention so far.

Me too

.

Daniel Pitts · Jan 14, 2013

Can you be more specific about when shutdown and await does
not work?

Arne

I am also interested. However, I will say that I would have used the
ExecutorService and simply used the Future objects to wait for
completion of all tasks. This has the added benefit that you can
actually process the results of the Callables in a single thread, rather
than have them try to update some shared data structure. Using the
Future result is inherently easier to get correct.

Arne Vajhøj · Jan 15, 2013

I use the -Xss on occasion for some apps, sometimes it's part of their
tuning docs. Usually I don't worry about thread stack size, not if I've
got a relatively small number (25 certainly being small) of threads.

Not when I need GB of memory for heap, and hundreds of megs for perm gen.

My approach here is my usual approach - go for a simple design, and if a
quick gut-check of performance implications raises no red flags then go
with that for implementation.

I think I have only used it once and that was a demo to prove that
Tomcat could handle 1000 executing but waiting requests in parallel.

But Roedy seemed concerned and the option is there.

Arne

Kevin McMurtrie · Jan 15, 2013

Arne Vajhøj said:
Can you be more specific about when shutdown and await does
not work?

Arne

There are race conditions in determining the predicted thread states.
There's simply no way to check the work queue plus all of the states of
all of the threads in an atomic manner without locking or using a master
thread. Sun's implementation has almost no locking and no master thread
so it makes mistakes while threads are transitioning states. The
variable sized thread pool is the worst (glitches maybe 10% of the
time!) but the others have rare glitches too. These glitches are fixed
when a new task is queued so a steady stream of tasks appears to work
correctly. What happened with the very last task is a gamble. That's
why you shouldn't use shutdown() to wait for completion.

I'm not going to walk through the code here. It's awful stuff that's
difficult to follow and it changes a little with each JVM release. You
can step through it in a debugger and see that there's some wishful
thinking in there. Careful placement of breakpoints can produce a fault
that doesn't recover when all breakpoints are removed.

Arved Sandstrom · Jan 15, 2013

You can always join() them.

Click to expand...

I am surprised that this got just one mention so far. In absence of
Executor (which has some issues, as has been mentioned) this is the most
straightforward way to do it.

// untested and from memory
final Thread[] threads = new Thread[15];

for ( int i = 0; i < threads.length; ++i ) {
final Thread th = new Thread(...);
th.start();
threads = th;
}

for ( final Thread th : threads ) {
th.join();
}

Kind regards

robert

You're both right, of course. One of the hazards - speaking for myself -
of seizing on java.util.concurrent and reading JCIP through several
times is that one may tend to forget the original Java concurrency
constructs.

I think join() satisfies Roedy's problem statement the best; in any case
the latches and barriers have different endgame behaviour.

AHS

markspace · Jan 15, 2013

variable sized thread pool is the worst (glitches maybe 10% of the
time!) but the others have rare glitches too. These glitches are fixed
when a new task is queued so a steady stream of tasks appears to work
correctly. What happened with the very last task is a gamble. That's
why you shouldn't use shutdown() to wait for completion.

I'm not going to walk through the code here. It's awful stuff that's

Hmm, could you produce a code example that glitches 10% of the time?
Otherwise I'm afraid that I'll have to file your claims under "internet
b.s." There's no way I'm going to believe that Oracle's code is that
buggy just on hearsay.

Kevin McMurtrie · Jan 16, 2013

markspace said:
Hmm, could you produce a code example that glitches 10% of the time?
Otherwise I'm afraid that I'll have to file your claims under "internet
b.s." There's no way I'm going to believe that Oracle's code is that
buggy just on hearsay.

I can prove it to you when you're paying me for it. Here are some bugs
with the variable sized thread pool:

With short tasks on a system having many CPU cores, the time it takes a
thread to be seen as idle becomes large compared to the task execution
time. The result is the needless creation of more threads The surge of
extra threads steals CPU time from existing threads, making the problem
worse. You can demonstrate this by feeding a fixed number of rapidly
completing tasks into a thread pool and then checking the pool size. A
common symptom can be seen in Tomcat on very large servers, where the
connection handler pool suddenly jumps from ~100 to thousands of threads.

The idle timeout doesn't work. Threads are used round-robin so they're
never idle as long as tasks keep coming at a rate of timeout/threads.
For a timeout of 60 seconds, 100 tasks per second will keep a pool of at
least 6000 threads alive. Combine this with the previous bug and you
can see that the variable sized pool is very broken.

Scheduled tasks do not expand the pool size so they starve a thread pool
of workers. Read the code for ScheduledThreadPoolExecutor if you'd like.

java.util.concurrent.Delayed is defined in relative time and must
implement Comparable<Delayed>. It's impossible to sort tasks using
relative time so Sun hacked it. Look at
java.util.concurrent.ScheduledThreadPoolExecutor.ScheduledFutureTask.comp
areTo(Delayed). Any implementation other than
ScheduledFutureCallableLink does not work reliably.

Fixed size pools have some problems too but they're less frequent and
unobtrusive. You'd probably never hit them unless you're using
shutdown() rather than Future.get() to wait for task completion.

Kevin McMurtrie · Jan 19, 2013

markspace said:
Hmm, could you produce a code example that glitches 10% of the time?
Otherwise I'm afraid that I'll have to file your claims under "internet
b.s." There's no way I'm going to believe that Oracle's code is that
buggy just on hearsay.

I hit another one today and it's directly related to the original post.
The shutdown() method causes a spin loop bug in
ThreadPoolExecutor.getTask().

This is wrong:
if (state == SHUTDOWN) // Help drain queue
r = workQueue.poll();

It should be:
if (state == SHUTDOWN) // Help drain queue
return workQueue.poll();

This was very frustrating because CPU was at 400% but it wouldn't show
up in any Java profiler. I had to pause all threads and start them up
one at a time.

shutdownNow() doesn't cause the bug.

Robert Klemme · Jan 20, 2013

Fixed size pools have some problems too but they're less frequent and
unobtrusive. You'd probably never hit them unless you're using
shutdown() rather than Future.get() to wait for task completion.

I was going to add that most of the issues of the implementation we are
talking about stem from the fact that the number of active threads is
adjusted all the time, i.e. threads are created and die (or at least
that was the intention) all the time.

For most real world problems I am aware of it is sufficient to create
all threads upfront and let them block waiting for work. You are
expecting to use them anyway at some point in time (that's why you set a
specific max) so at least when load is highest the biggest number of
threads is active anyway. Other resources (memory) have to be adjusted
to that fact anyway and if the pool is idle there will be enough memory
to keep those blocking threads around.

The only other scheme that seems to be reasonable to me is to create
threads on demand and never let them die before pool shutdown. That
would at least ensure low overhead for low load situations. But even
this scheme has some overhead in terms of locking already.

My preferred way for shutdown of thread pools is a stop element of which
as many instances as threads around are inserted into the queue. Every
worker threads which reads it terminates itself. That way you do not
need any additional locking and just join all active threads before you
let the shutdown method return. The details of proper shutdown are
still tricky because one can introduce a number of nasty race conditions
which will leave unprocessed items in the queue.

There are many applications though where production of work for the pool
is stopped before the pool is shut down so shutdown race conditions are
not an issues for those types of applications.

Kind regards

robert

Robert Klemme · Jan 26, 2013

Could be more specific?

See his reply from January 16. in this thread.

Kind regards

robert

How do I wait for a coroutine to finish?	1	May 18, 2023
I have to finish this code for my assignment but I cant figure out how to solve it	1	Jun 27, 2023
Merge sort and threads	0	Aug 16, 2024
Get await function in loop to finish before script ends	0	Oct 14, 2021
thread pool with waiting option	22	Apr 10, 2014
Help for my project in the last minute	0	Apr 23, 2022
How can I calculate the last payment for Reprofiled Amount column with 2 decimal places to make the sum of all payments to be the same as RC amount?	2	Jul 13, 2023
waiting threads until it gets value	2	Dec 27, 2009

Threads, waiting for last one to finish

Roedy Green

Arved Sandstrom

Knute Johnson

Daniel Pitts

Arne Vajhøj

Kevin McMurtrie

Arved Sandstrom

markspace

Kevin McMurtrie

Kevin McMurtrie

Robert Klemme

Robert Klemme

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads