I
Iain
Folks,
I'm having a problem with charset encodings that I desparately need some
help with. I don't even pretend to know the basics about charsets, so
please forgive my ignorance.
I am transforming XML source into XHTML using an encoding of iso-8859-1
and when I browse (using Mozilla 1.x) I see strange, accented 'A'
characters preceeding some characters generated from an entity
reference. If I use utf-8, things get a lot worse: even my
characters get prefixed with the accented junk.
My resultant XHTML source has the usual XML preamble at the top,
complete with encoding specification; however, it doesn't use <meta/> to
specify the charset -- could this be the cause of my problem?
Basically, because I don't understand this, and because I'd like to, can
someone recommend the practises I should be following when doing these
transforms, especially when using Perl and the XML::LibXML/XML::LibXSLT
to manage them.
Ideally, I'd like to use utf-8 (I'm guessing that's the best approach)
but it's been a bit of a non-started for me.
Hoping someone in c.t.xml or c.l.perl.misc can point me in the best
direction.
Many thanks,
Iain.
I'm having a problem with charset encodings that I desparately need some
help with. I don't even pretend to know the basics about charsets, so
please forgive my ignorance.
I am transforming XML source into XHTML using an encoding of iso-8859-1
and when I browse (using Mozilla 1.x) I see strange, accented 'A'
characters preceeding some characters generated from an entity
reference. If I use utf-8, things get a lot worse: even my
characters get prefixed with the accented junk.
My resultant XHTML source has the usual XML preamble at the top,
complete with encoding specification; however, it doesn't use <meta/> to
specify the charset -- could this be the cause of my problem?
Basically, because I don't understand this, and because I'd like to, can
someone recommend the practises I should be following when doing these
transforms, especially when using Perl and the XML::LibXML/XML::LibXSLT
to manage them.
Ideally, I'd like to use utf-8 (I'm guessing that's the best approach)
but it's been a bit of a non-started for me.
Hoping someone in c.t.xml or c.l.perl.misc can point me in the best
direction.
Many thanks,
Iain.