V
VB
Hi,
I'm building a metaseach engine based on data mining techniques....but
this is not important...
My question is about performances of the activity of scraping search
engine results from an HTML response page.
I see that some metasearch engines (Mamma, DogPile, Vivisimo & C.)
present top 50 results of 3-5 search engines in about 1 seconds.
With my perl script I am able to retrieve top 100 results of Google in
about 1,5 seconds, but from only one search engine!
Somebody (very much skilled in Perl) can tell me some advanced
technique (parallelism, thread...bo?) to retrieve from 3-5 search
engines very fast? (Hardware not included in this issue, I have a fast
hardware)
Excuse me for my english (I'm italian) and for my poor Perl skills.
Thanks,
VB
I'm building a metaseach engine based on data mining techniques....but
this is not important...
My question is about performances of the activity of scraping search
engine results from an HTML response page.
I see that some metasearch engines (Mamma, DogPile, Vivisimo & C.)
present top 50 results of 3-5 search engines in about 1 seconds.
With my perl script I am able to retrieve top 100 results of Google in
about 1,5 seconds, but from only one search engine!
Somebody (very much skilled in Perl) can tell me some advanced
technique (parallelism, thread...bo?) to retrieve from 3-5 search
engines very fast? (Hardware not included in this issue, I have a fast
hardware)
Excuse me for my english (I'm italian) and for my poor Perl skills.
Thanks,
VB