S
Steven D'Aprano
I'm trying to scrape a Wikipedia page from Python. Following instructions
here:
http://en.wikipedia.org/wiki/Wikipedia:Database_download
http://en.wikipedia.org/wiki/Special:Export
I use the URL "http://en.wikipedia.org/wiki/Special:Export/Train" instead
of just "http://en.wikipedia.org/wiki/Train". But instead of getting the
page I expect, and can see in my browser, I get an error page:
....
Our servers are currently experiencing a technical problem. This is
probably temporary and should be fixed soon
....
(Output is obviously truncated for your sanity and mine.)
Is there a trick to downloading from Wikipedia with urllib?
here:
http://en.wikipedia.org/wiki/Wikipedia:Database_download
http://en.wikipedia.org/wiki/Special:Export
I use the URL "http://en.wikipedia.org/wiki/Special:Export/Train" instead
of just "http://en.wikipedia.org/wiki/Train". But instead of getting the
page I expect, and can see in my browser, I get an error page:
....
Our servers are currently experiencing a technical problem. This is
probably temporary and should be fixed soon
....
(Output is obviously truncated for your sanity and mine.)
Is there a trick to downloading from Wikipedia with urllib?