Xerces parse aborted on IOError

J

Jim Cobban

Due to a problem which I am discussing on another thread the UTF-8 text in
my XML file is being corrupted.

The problem that leaves me with is that as soon as the Xerces parser hits
the bad UTF-8 character it throws:
java.io.UTFDataFormatException: invalid byte 2 of 3-byte UTF-8 sequence

and the parse is aborted.

This seems overkill.

Short of going in and modifying my copy of Xerces is there any way to get it
to keep on parsing the XML file past this error? Since this is an IOError,
not a SAXParseException it is not reported to the ErrorHandler interface.

--
Jim Cobban (e-mail address removed)
34 Palomino Dr.
Kanata, ON, CANADA
K2M 1M1
+1-613-592-9438
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,995
Messages
2,570,226
Members
46,815
Latest member
treekmostly22

Latest Threads

Top