How to do screen scraping where the site requires a log in

A

Alan Silver

Hello,

I would like to pull some information off a site that requires a log in.
I have a subscription to a premium content site, and I would like to be
able to do a few automatic requests instead of having to load the site
manually in a browser.

I have seen plenty articles that explain how to do screen scraping in
..NET, others that describe how to do it via a POST, but I couldn't find
any that covered my scenario.

Basically the problem is that the code would first have to call the home
page, then fill in the log in entries and post the page back. Then, the
code would need to hang on to the cookie (which is what I assume they
are using) so that when it does another request (GET would be fine
here), the site will allow the request and not think the requester is
not logged in.

This all works fine in a browser, as the browser handles the cookie for
you, but the code examples I have seen seem to use completely stateless
requests (ie no cookies preserved), so it wouldn't work for a site like
this.

Any ideas? TIA
 
Joined
Sep 5, 2006
Messages
11
Reaction score
0
Hi Alan,

All you need in this case is a global CookieContainer object that you assign to each request you make. The assignment is ByRef, so the request will add its cookies to the global instance of the container. This way, all subsequent requests will have whatever cookies are gathered during login.

If you need some code for this, I can dig it up from my archives; it's been a couple of years, but I know I have it floating around somewhere. The concept is pretty simple, though, so you may be able to get it working with no further help.
 
A

Alan Silver

You can try SWExplorerAutomation (SWEA) (http:\\webunittesting.com).

Thanks, looks interesting. The only shame is that I prefer to write my
own code rather than use someone else's. You don't get to understand
what's going on when you use a 3rd party app to do the grunt work.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,002
Messages
2,570,259
Members
46,858
Latest member
FlorrieTuf

Latest Threads

Top