S
slugger3113
Hi, I'm trying to get full/absolute URLs from relative links in HTML
documents. I've been trying to fudge this using File::Basename,
WWW::Mechanize, etc. but was wondering if there's a more ready-made
way to do this.
For example, if my main doc is:
http://www.abc.com/x/y/z/mydoc.html
and it contains a relative link to:
.../../otherdir/yourdoc.html
how do I get the absolute URL to "yourdoc.html"? Using the above
modules I've been able to get:
http://www.abc.com/x/y/z/../../otherdir/yourdoc.html
when what I want is:
http://www.abc.com/x/otherdir/yourdoc.html
Of course I could try and parse all of the possible variations for
relative paths, but it's making my head hurt and I was wondering if
there's a module that could help with this. Any thoughts would be
appreciated.
thanks
Scott
documents. I've been trying to fudge this using File::Basename,
WWW::Mechanize, etc. but was wondering if there's a more ready-made
way to do this.
For example, if my main doc is:
http://www.abc.com/x/y/z/mydoc.html
and it contains a relative link to:
.../../otherdir/yourdoc.html
how do I get the absolute URL to "yourdoc.html"? Using the above
modules I've been able to get:
http://www.abc.com/x/y/z/../../otherdir/yourdoc.html
when what I want is:
http://www.abc.com/x/otherdir/yourdoc.html
Of course I could try and parse all of the possible variations for
relative paths, but it's making my head hurt and I was wondering if
there's a module that could help with this. Any thoughts would be
appreciated.
thanks
Scott