V
vc
Hi,
I'm looking for an XML parser that wouldn't stop if it finds a minor error
in an XML file. I need to parse an HTML file and there are a lot of HTML
pages that, for instance, don't enclose attribute values in quotes.
Or, for instance, most of HTML pages don't have a root tag/element (that
could be "html"). Instead, they have "doctype" tag before and at the same
level with "html" and XML parsers report an error "no root tag found".
I have tried 3-4 SAX parsers, but none of them works :-(
It would be great if you can recommend a C++ or Java (preferably SAX 2.0
compliant) XML parser.
Thank you in advance,
vc
I'm looking for an XML parser that wouldn't stop if it finds a minor error
in an XML file. I need to parse an HTML file and there are a lot of HTML
pages that, for instance, don't enclose attribute values in quotes.
Or, for instance, most of HTML pages don't have a root tag/element (that
could be "html"). Instead, they have "doctype" tag before and at the same
level with "html" and XML parsers report an error "no root tag found".
I have tried 3-4 SAX parsers, but none of them works :-(
It would be great if you can recommend a C++ or Java (preferably SAX 2.0
compliant) XML parser.
Thank you in advance,
vc