ruby Thread in CGI application?

ako... · Jan 12, 2006

Hello,

i heard that ruby's Thread is not a native system thread but is
implemented in the interpreter itself. is that true? does this mean
that is is lighter than native threads? does anyone know if
Thread.start on every web request in my CGI application would severely
reduce my web server's scalability?

thanks
konstantin

David Vallner · Jan 12, 2006

ako... said:
Hello,

i heard that ruby's Thread is not a native system thread but is
implemented in the interpreter itself. is that true? does this mean
that is is lighter than native threads? does anyone know if
Thread.start on every web request in my CGI application would severely
reduce my web server's scalability?

thanks
konstantin

Err. CGI already does start a new process to process each request,
spinning off a worker thread is superfluous.

David Vallner

ako... · Jan 12, 2006

right. i mean FastCGI. a separate thread might be needed to execute a
code fragment with a safety level which is different from the safety
level of the rest of the CGI script.

konstantin

Zed Shaw · Jan 12, 2006

Hi konstantin,

As the author of SCGI (http://www.zedshaw.com/projects/scgi_rails/) I
can tell you that this is possible, but you have to be careful not to
step on anyone's toes. As David Vallner mentioned previously it's
pretty pointless in a plain CGI. In FastCGI, SCGI, or mod_ruby it's
viable but tricky.

Ruby threads are implemented using a select IO loop, so they're
fairly lightweight. You can also use Ruby's fork to give yourself
N:M threading if you want multiple processors to handle more requests
or if you want more than select can handle. Generally the Ruby
threading model is good but is a little weird in places.

From the SCGI perspective you want to avoid the following:

1) Leaving IO objects open after you're done with your request. It
will eventually get cleaned out (and I'm working on cleaning more),
but it's a total waste. Be good and keep your house clean.
2) Putting stuff in the thread local store. This is generally OK in
your own threads, but again this stuff doesn't go away until the GC
collects it which in Ruby means quite a while. If you seem to get
weird leaks then investigate this.
3) Wildly making calls to Rails functions inside the threads. Rails
isn't very thread safe, in fact I'd say it's not thread safe at all.
If you're running multiple threads that try to hit AR for example,
then you'll get tons of DB connections since AR by default uses
thread local storage for it's DB connection pooling. Other examples
are how modules get loaded--if a model or controller isn't loaded
when your thread fires off and you try to use it then Ruby throws a fit.
4) Expecting threads to solve problems in your workflow design.
Typically I see people who want to use threads to offload some
processing in the background so that the user can continue using the
web app while their big work is being done. What invariably ends up
happening is the background thread doesn't run reliably and causes
major havoc with the rest of the application's processing. This is
especially true if you use fork or exec. A better approach is to
either redesign the workflow to not need this or write a separate DRb
server that runs stand-alone and processes these requests. The DRb
solution is also really powerful since you can then offload the
processing to tons of other machines if you need.

Anyway, good luck on it. Feel free to hit me up if you have thread
problems.

Zed A. Shaw
http://www.zedshaw.com/

David Vallner · Jan 12, 2006

That said, it might also be questionable just how useful it is to create =
=20
request processor threads even in FastCGI / SCGI, because that's precisel=
y =20
the purpose of those. You're very likely better off tweaking the settings=
=20
of those to get what you might want done.

Now, a longer rant for the sake of completion: Using worker threads is =20
more viable, but that is only useful in certain scenarios, for example:

1. In interactive long-running programs, where you want to maintain a =20
responsive user interface while doing background work. This is _very_ =20
rarely the case with a web application, because you usually need to =20
process each request in finite (and rather short) time.

2. On a multiple CPU system, where algorithms used in processing the =20
request might benefit from parallelism running multiple processors. Thoug=
h =20
then again, you probably wouldn't do CPU-intensive operations as these =20
algorithms tend to be in Ruby anyway. As more realistic examples, =20
asynchronous database / remote calls come to mind if you want to implemen=
t =20
your own timeout policy to those to handle badly written third party =20
libraries graciously to prevent your application from hanging up.

Unless you have due cause to do so, do avoid multithreaded programming or=
=20
keep it very simple, there's just way too many ways to shoot yourself in =
=20
the foot.

David Vallner

Stefan Kaes · Jan 14, 2006

David said:
2. On a multiple CPU system, where algorithms used in processing the
request might benefit from parallelism running multiple processors.
Though then again, you probably wouldn't do CPU-intensive operations
as these algorithms tend to be in Ruby anyway. As more realistic
examples, asynchronous database / remote calls come to mind if you
want to implement your own timeout policy to those to handle badly
written third party libraries graciously to prevent your application
from hanging up.

Currently you won't benefit from multiple CPUs as Ruby doesn't use
native threads.

-- stefan

David Vallner · Jan 14, 2006

Currently you won't benefit from multiple CPUs as Ruby doesn't use =20
native threads.

-- stefan

Slipped my mind. I was talking in general anyway. And doesn't the pthread=
=20
binding let you use native threads? (Didn't really do anything with it =20
himself)

David Vallner

Stefan Kaes · Jan 14, 2006

David said:
Slipped my mind. I was talking in general anyway. And doesn't the
pthread binding let you use native threads? (Didn't really do anything
with it himself)

David Vallner

I don't know enough about Ruby's pthread support to have an informed
opinion on this question :-(

-- stefan

Robert Klemme · Jan 14, 2006

2006/1/14 said:
I don't know enough about Ruby's pthread support to have an informed
opinion on this question :-(

AFAIK there is no support for any native threading in standard Ruby
1.x. The interpreter just isn't designed for this. For web apps
Ruby's threading might be sufficient in the general case because there
is a lot IO and multiplexing on IO should work find.

Btw, native thread support for me is one of the key improvements of Ruby 2.

Kind regards

robert

How to call Ruby from non-interpreter native thread?	1	Apr 1, 2011
Ruby a good choice for CGI?	23	Nov 25, 2009
Monitor + killing thread => thread in an aborting state	1	Feb 22, 2009
deque and thread-safety	0	Oct 12, 2012
Installing Ruby Application	6	Jan 26, 2010
The desktop application on Ruby	1	Mar 5, 2010
rb_funcall() Ruby code callback invoked from within a native thread?	0	Oct 4, 2006
boost::thread	6	Aug 1, 2011

ruby Thread in CGI application?

ako...

David Vallner

ako...

Zed Shaw

David Vallner

Stefan Kaes

David Vallner

Stefan Kaes

Robert Klemme

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads