xml html codes

  • Thread starter Jayme Assuncao Casimiro
  • Start date
J

Jayme Assuncao Casimiro

I am witing a wrapper to extract data from werb sites. I am storing the
data in xml format. But it doesn't produce valid xml because of html codes
like the above - typical of latin languages.
á
ã
ó
ö
é

What Would I do to overcome this problem?

+---------------------------------------------+
| Jayme Assuncao Casimiro |
| Graduado em Ciência da Computação |
| Estudante de Mestrado em Computação |
| Universidade Federal de Minas Gerais - UFMG |
+---------------------------------------------+
 
M

Martin Honnen

Jayme said:
I am witing a wrapper to extract data from werb sites. I am storing the
data in xml format. But it doesn't produce valid xml because of html codes
like the above - typical of latin languages.
á
ã
ó
ö
é

What Would I do to overcome this problem?

These are references to entities that XHTML defines, so you could make
your DTD declare those entities.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,995
Messages
2,570,233
Members
46,820
Latest member
GilbertoA5

Latest Threads

Top