SAXParser and preserving special characters

User · Oct 31, 2003

I am trying to use JDOM's SAXBuilder to parse an XML document that contains
encoded latin-1 characters. After I parse the document, the special
character Strings seem to be replaced with their unicode characters (e.g.,
the String "®" is replaced with a character that has a decimal value of
174); I was expecting that the SAXBuilder would preserve the String
"®". Is it possible to instruct the SAX parser to preserve the special
character encodings?

The following is sample code that illustrates the issue that I am observing:

import java.io.ByteArrayInputStream;

import org.jdom.Document;
import org.jdom.input.SAXBuilder;
import org.jdom.output.XMLOutputter;

public class TestProductBuilder {

public static void main(String[] args) {
ByteArrayInputStream bis = null;
try {
String product = "<?xml version=\"1.0\"?>" +
"<product>" +
" <name>My Product ®</name>" +
"</product>";

bis = new ByteArrayInputStream(product.getBytes());
SAXBuilder builder = new SAXBuilder(false);
Document productDoc = builder.build(bis);

XMLOutputter outputter = new XMLOutputter("\t", true);
String productFromSAXBuilder = outputter.outputString(productDoc));
} catch (Exception e) {
System.err.println(e.getMessage());
} finally {
if (bis != null) { try { bis.close(); } catch (Exception e) {}}
}
}
}

The following is the value for "productFromSAXBuilder":
<?xml version="1.0" encoding="UTF-8"?>
<product>
<name>My Product ®</name>
</product>

Issue: special characters	0	Jul 15, 2011
Unicode and SAXParser	3	Dec 3, 2003
Extra Lines inserted for Special Characters (&)	2	Jan 24, 2007
How to convert MS Word special characters to HTML codes?	1	Mar 31, 2012
libxml's SaxParser and UTF-8 problem	2	Mar 2, 2007
XML problem with special characters like "<" and ">"	1	Jul 28, 2004
Handling (retain) special characters when parsing XML?	1	Apr 5, 2007
Problem with SaxParser. Works Occasionally.	11	Feb 27, 2007

SAXParser and preserving special characters

User

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads