html entity to unicode

Z

zunbeltz

Hi,

I'm parsing html. I have a page with a lot of html enitties for hebrew
characters. When i print what i get are blanks, dots and commas. How
can i decode this entities to unicode charachters?

TIA

Zunbeltz
 
P

Peter Maas

Hi,

I'm parsing html. I have a page with a lot of html enitties for hebrew
characters. When i print what i get are blanks, dots and commas. How
can i decode this entities to unicode charachters?

Python doc

13.4 htmlentitydefs -- Definitions of HTML general entities

[...]

name2codepoint
A dictionary that maps HTML entity names to the Unicode codepoints. New in version 2.3.

codepoint2name
A dictionary that maps Unicode codepoints to HTML entity names. New in version 2.3.

Peter Maas Aachen
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,284
Messages
2,571,413
Members
48,106
Latest member
JamisonDev

Latest Threads

Top