Limit number of concurrent running threads in pool

J

Joe Martin

Hi
I created a pool of threads (say, 500 threads) to process. However, due
to the weight of each thread, I want to limit the number of threads that
run concurrently.

So how would I go about putting a limit on the number of threads that
run at any given time? I would like to take, say, 5 threads from the
pool and run them, and as each one completes, it is removed from the
pool and is replaced with a new thread from the pool.

Could this be done with a "spy" thread, in that it constantly loops to
check how many threads are running at once, and if the number of running
threads falls below the limit of 5, it takes the next thread out of the
pool and runs it? Not sure how I would go about doing this, pretty new
to multithreading.

Thanks!
 
R

Roger Pack

I created a pool of threads (say, 500 threads) to process. However, due
to the weight of each thread, I want to limit the number of threads that
run concurrently.

So how would I go about putting a limit on the number of threads that
run at any given time? I would like to take, say, 5 threads from the
pool and run them, and as each one completes, it is removed from the
pool and is replaced with a new thread from the pool.

http://github.com/spox/actionpool

might help.
-r
 
R

Robert Klemme

2010/3/4 Joe Martin said:
I created a pool of threads (say, 500 threads) to process. =A0However, du= e
to the weight of each thread, I want to limit the number of threads that
run concurrently.

So how would I go about putting a limit on the number of threads that
run at any given time? =A0I would like to take, say, 5 threads from the
pool and run them, and as each one completes, it is removed from the
pool and is replaced with a new thread from the pool.

Could this be done with a "spy" thread, in that it constantly loops to
check how many threads are running at once, and if the number of running
threads falls below the limit of 5, it takes the next thread out of the
pool and runs it? =A0Not sure how I would go about doing this, pretty new
to multithreading.

Why do you create a pool much larger than the load you want to accept?
Usually the pool size is used to limit concurrency. Actually that is
the main purpose of thread pools.

If you have different tasks for which you want to have different
limits on concurrency you could also create several pools with
different sizes.

Kind regards

robert

--=20
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/
 
C

Chuck Remes

Hi
I created a pool of threads (say, 500 threads) to process. However, due
to the weight of each thread, I want to limit the number of threads that
run concurrently.

So how would I go about putting a limit on the number of threads that
run at any given time? I would like to take, say, 5 threads from the
pool and run them, and as each one completes, it is removed from the
pool and is replaced with a new thread from the pool.

Could this be done with a "spy" thread, in that it constantly loops to
check how many threads are running at once, and if the number of running
threads falls below the limit of 5, it takes the next thread out of the
pool and runs it? Not sure how I would go about doing this, pretty new
to multithreading.

I've had very good success using the Threadz gem.

http://github.com/nanodeath/threadz

It's quite easy to understand and works very well with MRI and JRuby.

cr
 
J

Joe Martin

Robert said:
Why do you create a pool much larger than the load you want to accept?
Usually the pool size is used to limit concurrency. Actually that is
the main purpose of thread pools.

If you have different tasks for which you want to have different
limits on concurrency you could also create several pools with
different sizes.

Kind regards

robert

When running this program, I will provide a list of items that need
processing. In some cases, this list can be as long as 250 items, in
other cases well over 50,000. The processing of each item can take
anywhere from 15 to 60 seconds per item, so you can see there is a
benefit to multithreading here. In processing each item, there are also
a number of database calls that occur, so I would like to put a cap on
the number of actively running threads to avoid overwhelming the
database. Am I going about this the wrong way? Is there a more
effecient more suitable way of doing this?
 
R

Robert Klemme

2010/3/5 Joe Martin said:
When running this program, I will provide a list of items that need
processing. =C2=A0In some cases, this list can be as long as 250 items, i= n
other cases well over 50,000. =C2=A0The processing of each item can take
anywhere from 15 to 60 seconds per item, so you can see there is a
benefit to multithreading here. =C2=A0In processing each item, there are = also
a number of database calls that occur, so I would like to put a cap on
the number of actively running threads to avoid overwhelming the
database. =C2=A0Am I going about this the wrong way? =C2=A0Is there a mor= e
effecient more suitable way of doing this?

For this scenario a thread pool with fixed size seems sufficient.

queue =3D Queue.new # or bounded queue

def cont(item) !item.nil? end

threads =3D (1..10).map do
Thread.new do
while (cont(item =3D queue.deq))
# .. process
end
end
end

queue.enc "Task"

threads.size.times do
queue.enq nil # terminate
end

threads.each {|th| th.join}

As simple as that.

Kind regards

robert


--=20
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/
 
C

Chuck Remes

=20
When running this program, I will provide a list of items that need=20
processing. In some cases, this list can be as long as 250 items, in=20=
other cases well over 50,000. The processing of each item can take=20
anywhere from 15 to 60 seconds per item, so you can see there is a=20
benefit to multithreading here. In processing each item, there are = also=20
a number of database calls that occur, so I would like to put a cap on=20=
the number of actively running threads to avoid overwhelming the=20
database. Am I going about this the wrong way? Is there a more=20
effecient more suitable way of doing this?

The Threadz gem let's you create a thread pool and then wait on its =
completion before you add more to it. This mechanism will help you cap =
the number of threads making database calls.

cr
 
J

Joe Martin

Robert said:
For this scenario a thread pool with fixed size seems sufficient.

Very good. This works quite nicely as well.

Just wondering, are there any performance benefits of using one method
over the other?
 
C

Caleb Clausen

Very good. This works quite nicely as well.

Just wondering, are there any performance benefits of using one method
over the other?

Yes. Threads use memory (quite a lot of it, in fact). Mostly this goes
to the thread's stack. Limiting the number of threads saves quite a
bit of memory. I'm not sure there's any improvement in the amount of
cpu time either way, other than perhaps some fewer cache misses
resulting from using less memory.
 
R

Robert Klemme

Very good. This works quite nicely as well.

Just wondering, are there any performance benefits of using one method
over the other?

Which other method are you referring to?

Kind regards

robert
 
J

Joe Martin

Caleb said:
Yes. Threads use memory (quite a lot of it, in fact). Mostly this goes
to the thread's stack. Limiting the number of threads saves quite a
bit of memory. I'm not sure there's any improvement in the amount of
cpu time either way, other than perhaps some fewer cache misses
resulting from using less memory.

Thanks! I figured this was the case.
 
J

Joe Martin

Robert said:
Which other method are you referring to?

Well my original thinking was to create a thread for every single item I
pass to the program. Caleb answered this for me in that threads
themselves are quite heavy, so it further leads me to go with your
solution which creates a low number of threads and re-uses them until
the queue is empty. It seems much more simple and effecient than my
original plan.
 
R

Robert Klemme

2010/3/8 Joe Martin said:
Well my original thinking was to create a thread for every single item I
pass to the program. =A0Caleb answered this for me in that threads
themselves are quite heavy, so it further leads me to go with your
solution which creates a low number of threads and re-uses them until
the queue is empty. =A0It seems much more simple and effecient than my
original plan.

Ah, OK. Then I was just confused because you had used the term
"thread pool" in your original posting. That term is commonly use for
exactly the type of solution I posted (fixed or at least limited
number of threads which get their tasks from some form of queue). You
probably weren't aware of this.

Kind regards

robert


--=20
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,982
Messages
2,570,190
Members
46,740
Latest member
AdolphBig6

Latest Threads

Top