Anyone knows about any java packages which can read an HTML code (from a
file or as a string input) and return it as a 'well-formatted' HTML ?
TagSoup, JTidy and NekoHTML can all be used this way. IME, TagSoup is the
most liberal parser, and NekoHTML is the most strict, but still deals well
with HTML as it is found in the wild.
tom
--
Formal logical proofs, and therefore programs - formal logical proofs
that particular computations are possible, expressed in a formal system
called a programming language - are utterly meaningless. To write a
computer program you have to come to terms with this, to accept that
whatever you might want the program to mean, the machine will blindly
follow its meaningless rules and come to some meaningless conclusion. --
Dehnadi and Bornat