downloading web page as HTML accessed through WATIR

M

michael

is there anyway to download the page accessed through WATIR as html
page??

for instance,


require 'watir'

ie = Watir::IE.start("http://www.yahoo.com")


above codes will open www.yahoo.com.. what should we do to download
this page as yahoo.htm??

any suggestion or hints will be deeply appreciated..


michael
 
L

lrlebron

ie.html is what you are looking for

aFile = File.new("yahoo.htm" , "w")
aFile << ie.html
aFile.close

and if you want to view the file then

ie.goto("yahoo.htm")
 
D

David Vallner

--------------enig6BF3527D94378911740CEE77
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
is there anyway to download the page accessed through WATIR as html
page??
=20
for instance,
=20
=20
require 'watir'
=20
ie =3D Watir::IE.start("http://www.yahoo.com")
=20
=20
above codes will open www.yahoo.com.. what should we do to download
this page as yahoo.htm??
=20
any suggestion or hints will be deeply appreciated..
=20
=20

If this is all you need, you might as well use Net:HTTP, or open-uri -
this should have smaller overhead since you're not instantiating an IE
control.

require 'open-uri'

open('http://www.yahoo.com/') { |html|
open('yahoo.html', 'w') { |out|
out.print(html.read)
}
}

David Vallner


--------------enig6BF3527D94378911740CEE77
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)

iD8DBQFFO2+zy6MhrS8astoRAkUKAJ96rYWkLXiM9KUrLflYhKCZLtVGKgCdFtFu
J1DKU0MjpGzSRGjxX8HSEJ0=
=5vlt
-----END PGP SIGNATURE-----

--------------enig6BF3527D94378911740CEE77--
 
C

Chris McMahon

ie.html is what you are looking for

aFile = File.new("yahoo.htm" , "w")
aFile << ie.html
aFile.close

and if you want to view the file then

ie.goto("yahoo.htm")

Be very very careful here, make sure you understand what you are doing.
Watir does *not* see the HTML on the page-- Watir only sees the DOM in
Internet Explorer. If the HTML is missing a "/p>", for instance, Watir
can't see it. The #html method will *always* yield valid HTML, because
it it interpreting the DOM, regardless of how awful the original HTML
may or may not be.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,995
Messages
2,570,226
Members
46,815
Latest member
treekmostly22

Latest Threads

Top