C
cskilbeck
Hi,
I need to extract everything between <table> and </table> on a website
(there's only one table on the page. So far I have:
require 'open-uri'
page = open('http://xxx.html').read
page.gsub!(/\n/,"")
page.gsub!(/\r/,"")
inner = page.scan(%r{.*<table.*>(.*)</table>.*}m)
print inner
but inner is empty - any ideas?
If I substitute line 2 with
page = '123<table>456</table>789
I get inner = 456, which is correct.
I need to extract everything between <table> and </table> on a website
(there's only one table on the page. So far I have:
require 'open-uri'
page = open('http://xxx.html').read
page.gsub!(/\n/,"")
page.gsub!(/\r/,"")
inner = page.scan(%r{.*<table.*>(.*)</table>.*}m)
print inner
but inner is empty - any ideas?
If I substitute line 2 with
page = '123<table>456</table>789
I get inner = 456, which is correct.