D
Darel Finkbeiner
This may be the wrong group, so let me know.
My "problem" is this: I am writing my commentary in DocBook 5 and
using the program xsltproc and the docbook5 XSL stylesheets to produce
XHTML output. Since it is a commentary, it has both English and
polytonic Greek with combining diacritics in it. My console and VIM
are both perfectly configured to allow me to edit such documents in a
very natural and easy way, and one in which I can actually read the
Greek that I've typed in.
After processing with xsltproc, all of my beautiful UTF-8 encoded
Greek is being transformed into butt-ugly entity references.
Now, I suppose, "technically speaking", this isn't an issue when
viewing the html document in a browser.... maybe. But I like to be
able to view and "debug" the resulting file in a text editor as I want
to ... additionally, how am I to be sure that the "correct" UTF-8
code points are being used for crucial combining marks ( and by
"correct", I mean the exact code points that I have chosen to use,
since there are alternatives in the unicode standard )? I
specifically chose XHTML output because it is natively UTF-8, so why
convert them to entities in the first place?
My question is, how do I turn off this "feature"? Or can I? Or
should I use a different XSLT processor?
My "problem" is this: I am writing my commentary in DocBook 5 and
using the program xsltproc and the docbook5 XSL stylesheets to produce
XHTML output. Since it is a commentary, it has both English and
polytonic Greek with combining diacritics in it. My console and VIM
are both perfectly configured to allow me to edit such documents in a
very natural and easy way, and one in which I can actually read the
Greek that I've typed in.
After processing with xsltproc, all of my beautiful UTF-8 encoded
Greek is being transformed into butt-ugly entity references.
Now, I suppose, "technically speaking", this isn't an issue when
viewing the html document in a browser.... maybe. But I like to be
able to view and "debug" the resulting file in a text editor as I want
to ... additionally, how am I to be sure that the "correct" UTF-8
code points are being used for crucial combining marks ( and by
"correct", I mean the exact code points that I have chosen to use,
since there are alternatives in the unicode standard )? I
specifically chose XHTML output because it is natively UTF-8, so why
convert them to entities in the first place?
My question is, how do I turn off this "feature"? Or can I? Or
should I use a different XSLT processor?