Reading Data from a Website

A

anon1m0us

Hi;
No clue how to do this. My program to go to a website and read data and
process it. Don't kow where to even begin! How do I go to a website in
RUBY? How to I start reading the data?
 
A

Andy Lester

Hi;
No clue how to do this. My program to go to a website and read data
and
process it. Don't kow where to even begin! How do I go to a website in
RUBY? How to I start reading the data?

Look at WWW::Mechanize.
 
A

anon1m0us

Is that a website? Where do I see that stuff?
In addition;
i need to view the Source of the website since the information are
contained in tables on the website.
 
P

Peter Szinek

anon1m0us said:
Hi;
No clue how to do this. My program to go to a website and read data and
process it. Don't kow where to even begin! How do I go to a website in
RUBY? How to I start reading the data?

You could check out my older (but still fine I guess) article on this:

http://www.rubyrailways.com/data-extraction-for-web-20-screen-scraping-in-rubyrails

It would need some polishing and adding HPricot there (working on it
actually), but even like this it could provide some help.

btw. I am just releasing (in 2-3-4 something days) a powerful web
extraction language written in Ruby. It is based on Mechanize and
Hpricot and it really does a lot of heavy lifting (although I may be a
little bit biased for obvious reasons :) - well you will see it
yourself next week)

Peter
__
http://www.rubyrailways.com
 
G

Gavin Baker

btw. I am just releasing (in 2-3-4 something days) a powerful web
extraction language written in Ruby. It is based on Mechanize and
Hpricot and it really does a lot of heavy lifting (although I may
be a little bit biased for obvious reasons :) - well you will see
it yourself next week)

After finding your article on screen scraping *very* useful, I'm
really looking forward to this!

Gav
 
P

Peter Szinek

Gavin said:
On 18 Jan 2007, at 21:34, Peter Szinek wrote:
After finding your article on screen scraping *very* useful, I'm really
looking forward to this!

I am happy to hear this... Web scraping can be very-very-very tedious,
(even with a superb tool like scRUBYt! :)) so I will need a lot of
users to try it on a lot of pages to help find and report the problems
and come out with a really stable system. On the pages I am testing it
works perfectly (and it already has a decent feature set), however, so
far nearly always when I went to a previously unknown page there were
some problems...

However, as you will see it will worth the time to report problems etc.
because in the case of complex scenarios the solution will be much-much
faster and robust than with a hand-coded stuff...

Back to coding :)

Cheers,
Peter

__
http://www.rubyrailways.com
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,222
Messages
2,571,142
Members
47,757
Latest member
PDIJaclyn

Latest Threads

Top