Parsing content between sibling nodes in poorly formed documents

alex.cline · Sep 18, 2007

I'm trying to write an XSL parser to extract all information between
two sibling nodes. The problem is, the documents are very poorly
formatted and that's where I'm running into difficulty. Here is an
example of the layout of the document:

<root>
<header>
content
</header>
<node />

content to extract.

<node />
<footer>
content
</footer>
</root>

I'm trying to extract all content between the two <node /> elements.
The data can include either text or other nested elements.

Thank you.

-- Alex

David Carlisle · Sep 18, 2007

I'm trying to write an XSL parser to extract all information between
two sibling nodes. The problem is, the documents are very poorly
formatted and that's where I'm running into difficulty. Here is an
example of the layout of the document:

<root>
<header>
content
</header>
<node />

content to extract.

<node />
<footer>
content
</footer>
</root>

I'm trying to extract all content between the two <node /> elements.
The data can include either text or other nested elements.

Thank you.

-- Alex

/root/*[preceding-sibling::node and following-sibling::node]

David

Joseph Kesselman · Sep 18, 2007

I'm trying to write an XSL parser

Uhm. Terminology problem there. Do you mean an XML parser, or an XSLT
processor, or an XSLT stylesheet?

Hard to give you a good answer without being sure what question you're
asking.

alex.cline · Sep 27, 2007

/root/*[preceding-sibling::node and following-sibling::node]

David

Thanks. Your solution led me to mine:

/root/*[preceding-sibling::node and following-sibling::node]|/root/
text()[preceding-sibling::node and following-sibling::node]

XPath question: selecting content between two nodes	3	Mar 7, 2006
Using a variable to tell xsl:for-each which nodes to select	0	Oct 4, 2008
XSLT Extract Text from Nodes	9	Oct 10, 2006
Selecting a set of nodes	1	Oct 13, 2006
XSLT Compare two documents and output differences	4	Jun 22, 2007
combining two documents	2	Jul 31, 2003
HTML Parsing Question	2	Dec 31, 2006
MiniQuiz : Renesting Nodes (OWLScratch)	1	Jun 23, 2005

Parsing content between sibling nodes in poorly formed documents

alex.cline

David Carlisle

Joseph Kesselman

alex.cline

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads