Disambiguating separated node sets

Andy Dingley · Oct 20, 2005

I have a publishing application. The database layer queries various
sources and produces an XML document, then XSLT processes this into HTML
or RSS.

In this particular case, there are several queries for the latest
articles from various "chapters" (news, reviews etc), these are then
placed as sub-trees in the XML. I then need to generate a single list of
articles by selecting the articles from each sub-tree.

It's possible for an article to be "multi-homed", so it might appear in
two chapters. My output must filter these, so that an article appears no
more than once.

As the XML document contains duplicated articles (duplicates of single
database articles), it's not possible to use generate-id() here. Instead
I must use the @articleid attribute. I'm also finding it impractical to
use a preceding-sibling:: axis, because these articles come from
disjoint sub-trees and so aren't siblings. For platform portability
reasons, I'm reluctant to use node-set()

What's the best way to diambiguate these ?

At present it works, but the code is a mess. Rather than filtering them
and then passing the filtered set neatly to the output routine, I'm
having to pass the set with duplicates to a named template, then filter
it inside that, using position(). I'd prefer to decouple the filter and
the loop processing, for other reasons of good code structure.

Thanks for any comments

<xsl:variable name="items" select="$item-headline-article
| $items-news [position () <= 4]
| $items-reviews [position () <= 3]
| $items-competition" />

[...]

<xsl:for-each select="$items" >
<xsl:variable name="entry" select="." />
<xsl:variable name="articleid" select="$entry/@articleid" />
<xsl:variable name="idx" select="position ()" />

<xsl:if test="not ($entries [($articleid = ./@articleid)
and (position() < $idx) ] ) " >

[...]
</xsl:if>
</xsl:for-each>

Gomolyako Eduard · Oct 20, 2005

I have the same problem and here is my issue:

xml:
<root>
<item id="1" />
<item id="2" />
<item id="3" />
<item id="2" />
</root>

As i understand you want get a kind of this:
<another_root>
<item id="1" />
<item id="2" />
<item id="3" />
</another_root>

xslt:

<xsl:template match="root">
<another_root>
<xsl:apply-templates select="item[1]">
<xsl:with-param name="handled-items" select="string('')" />
</xsl:apply-templates>
</another_root>
</xsl:template>

<xsl:template match="item">
<xsl

aram name="handled-items" />

<xsl:variable name="id" select="concat('.', @id)" />

<xsl:if test="not(contains(string($handled-items), string($id)))">
<xsl:copy-of select="." />

<xsl:variable name="v-handled-items"
select="concat(string($handled-items), string($id))" />

<xsl:apply-templates select="following-sibling::item[1]">
<xsl:with-param name="handled-items"
select="string($v-handled-items)" />
</xsl:apply-templates>
</xsl:if>
</xsl:template>

I hope this helps you.

Best, Ed.

Andy said:
I have a publishing application. The database layer queries various
sources and produces an XML document, then XSLT processes this into HTML
or RSS.

In this particular case, there are several queries for the latest
articles from various "chapters" (news, reviews etc), these are then
placed as sub-trees in the XML. I then need to generate a single list of
articles by selecting the articles from each sub-tree.

It's possible for an article to be "multi-homed", so it might appear in
two chapters. My output must filter these, so that an article appears no
more than once.

As the XML document contains duplicated articles (duplicates of single
database articles), it's not possible to use generate-id() here. Instead
I must use the @articleid attribute. I'm also finding it impractical to
use a preceding-sibling:: axis, because these articles come from
disjoint sub-trees and so aren't siblings. For platform portability
reasons, I'm reluctant to use node-set()

What's the best way to diambiguate these ?

At present it works, but the code is a mess. Rather than filtering them
and then passing the filtered set neatly to the output routine, I'm
having to pass the set with duplicates to a named template, then filter
it inside that, using position(). I'd prefer to decouple the filter and
the loop processing, for other reasons of good code structure.

Thanks for any comments

<xsl:variable name="items" select="$item-headline-article
| $items-news [position () <= 4]
| $items-reviews [position () <= 3]
| $items-competition" />

[...]

<xsl:for-each select="$items" >
<xsl:variable name="entry" select="." />
<xsl:variable name="articleid" select="$entry/@articleid" />
<xsl:variable name="idx" select="position ()" />

<xsl:if test="not ($entries [($articleid = ./@articleid)
and (position() < $idx) ] ) " >

[...]
</xsl:if>
</xsl:for-each>

Dimitre Novatchev · Oct 20, 2005

Why so many people forget to provide a source xml document (as minimal as
possible)?

Cheers,
Dimitre Novatchev.

XSL node as string	1	Feb 26, 2009
xpath: comparing two node sets	3	Mar 9, 2005
select the following node	1	Jan 17, 2007
Problem in xml transformation using xslt	0	Nov 25, 2011
keys break when using node-set function	1	Mar 7, 2008
Unable to update variable	2	Jun 3, 2008
XSL Grouping	0	Apr 28, 2010
xsl variables and node-sets	0	Dec 3, 2003

Disambiguating separated node sets

Andy Dingley

Gomolyako Eduard

Dimitre Novatchev

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads