unicode, C++, python 2.2

Guest · Sep 9, 2005

I am currently writing a python interface to a C++ library. Some of the
functions in this library take unicode strings (UTF-8, mostly) as arguments.

However, when getting these data I run into problem on python 2.2
(RHEL3) - while the data is all nice UCS4 in 2.3, in 2.2 it seems to be
UTF-8 on top of UCS4. UTF8 encoded in UCS4, meaning that 3 bytes of the
UCS4 char is 0 and the first one contains a byte of the string encoding
in UTF-8.

Is there a trick to get python 2.2 to do UCS4 more cleanly?

Guest · Sep 11, 2005

Trond said:
I am currently writing a python interface to a C++ library. Some of the
functions in this library take unicode strings (UTF-8, mostly) as
arguments.

However, when getting these data I run into problem on python 2.2
(RHEL3) - while the data is all nice UCS4 in 2.3, in 2.2 it seems to be
UTF-8 on top of UCS4. UTF8 encoded in UCS4, meaning that 3 bytes of the
UCS4 char is 0 and the first one contains a byte of the string encoding
in UTF-8.

Is there a trick to get python 2.2 to do UCS4 more cleanly?

It's hard to tell from your message what your problem really is, as we
have not clue what "these data" are. How do you know they are "nice
UCS4" in 2.3? Are you looking at the internal representation at the
C level, or are you looking at something else? Do you use byte strings
or Unicode strings?

You tried to explain what "UTF8 encoded in UCS4" might be, but I'm
not sure I understand the explanation: what precise sequence of
statements did you use to create such a thing, and what precisely
does it look like (what exact byte is first, what is second, and so
on)?

Regards,
Martin

Unicode	20	Dec 16, 2012
Python Unicode handling wins again -- mostly	67	Nov 30, 2013
python\bluetooth / wsgi / apache 2.2	0	Dec 13, 2012
Python 3.3, gettext and Unicode problems	0	Dec 31, 2012
Python unicode utf-8 characters and MySQL unicode utf-8 characters	2	Jan 18, 2011
How is unicode implemented behind the scenes?	4	Mar 9, 2014
Thinking Unicode	0	Aug 8, 2013
Unicode problem in ucs4	15	Mar 19, 2009

unicode, C++, python 2.2

Guest

Guest

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads