Get title from URL?

Cisco Ri · Apr 24, 2009

Anybody have a code snippet that extracts the title from the <title> tag
from a given URL?

Heesob Park · Apr 24, 2009

2009/4/24 Cisco Ri said:
Anybody have a code snippet that extracts the title from the <title> tag
from a given URL?

require 'rubygems'
require 'mechanize'
title = WWW::Mechanize.new.get('http://google.com').title
=> "Google"

Regards,
Park Heesob

Cisco Ri · Apr 24, 2009

Thanks, both work great.

RÃ¼diger Brahns · Apr 27, 2009

Cisco said:
Anybody have a code snippet that extracts the title from the <title> tag
from a given URL?

Without installing special things:

require 'open-uri'
open('http://google.com/').read =~ /<title>(.*?)<\/title>/
p $1

Cisco Ri · Apr 27, 2009

Heesob said:
require 'rubygems'
require 'mechanize'
title = WWW::Mechanize.new.get('http://google.com').title
=> "Google"

Regards,
Park Heesob

I used this method for a while, and it was fine for most sites.
However, with wikipedia.org it errored out with a 403 Forbidden error.
The Hpricot/open-uri method works for most sites, including
wikipedia.org, but for thesixtyone.com (Javascript intensive site) it
errors out with a 500 Internal Server error.

I haven't tried out the open-uri only method yet.

Thanks for the help everyone.

Heesob Park · Apr 28, 2009

2009/4/28 Cisco Ri said:
I used this method for a while, and it was fine for most sites.
However, with wikipedia.org it errored out with a 403 Forbidden error.
The Hpricot/open-uri method works for most sites, including
wikipedia.org, but for thesixtyone.com (Javascript intensive site) it
errors out with a 500 Internal Server error.

You can work around like this:

require 'rubygems'
require 'mechanize'
agent = WWW::Mechanize.new
agent.user_agent_alias = 'Mac Safari'
title = agent.get('http://wikipedia.org').title

Regards,
Park Heesob

Im having trouble containing a title an image and a button	2	Oct 26, 2022
Working on mobile css menu with plenty of frustration!	2	Dec 29, 2022
CSS Grid. Im having trouble containing a title an image and a button	1	Oct 25, 2022
I'm about to get in trouble with the HTML <body></body> tags	10	Aug 12, 2023
Why doesn't the function get called?	1	Nov 20, 2023
How to push data from one HTML page to another	4	Jan 3, 2024
New to coding Looking to make friends	5	Oct 25, 2024
Song requests	4	Aug 16, 2023

Get title from URL?

Cisco Ri

Heesob Park

Cisco Ri

RÃ¼diger Brahns

Cisco Ri

Heesob Park

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads