Parsing HTML using TreeBuilder - how to get the "next" tag?

Bruce Horrocks · Jun 12, 2005

I have a large (6Mb) HTML file that has been generated by a software
application's "document" function which I am trying to parse using
HTML::TreeBuilder. It consists of lots of lines in the form:

<p> Text text text text text
<p> Text text text text text
....
<p> Text text text text text
<h1>Section Heading</h1>
<p> Blah blah blah blah
<p> Blah blah blah blah
<p> Blah blah blah blah
....

I can use $tree->look_down() to find the h1 heading but then, how do I
get the next line? All the examples assume that the thing you want is a
*child* of the heading, not the *next* tag.

This requirement seems to be so basic that I must be missing something
but I can't see what. Perl is ActiveState 5.8.6 on Win32.

Thanks in advance

Bruce Horrocks · Jun 12, 2005

Bruce Horrocks said:
I can use $tree->look_down() to find the h1 heading but then, how do I
get the next line? All the examples assume that the thing you want is a
*child* of the heading, not the *next* tag.

Okay, found it (I think)
HTML::Element->right() looks to be what I'm after. Sorry for the noise.

Regards,

HTML::TreeBuilder issue	6	Feb 5, 2009
Parsing HTML - using HTML::TreeBuilder	7	Oct 5, 2006
Using CSS ID Tags to	2	Apr 21, 2020
How can I add arrows to my FAQ	0	Aug 9, 2023
Need assistance finetuning HTML, CSS, Javascript - sticky header issue	3	Feb 25, 2022
Problem parsing HTML	7	Nov 24, 2009
I am trying to make an audio player, how do I get the selected file to be playable?	5	Mar 29, 2022
Only one table shows up with the information	2	Mar 29, 2023

Parsing HTML using TreeBuilder - how to get the "next" tag?

Bruce Horrocks

Bruce Horrocks

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads