unicode encoding problem

garykpdx · Apr 28, 2005

Every time I think I understand unicode, I prove I don't.

I created a variable in interactive mode like this:
s = u'ä'
where this character is the a-umlaut
that worked alright. Then I encoded it like this:
s.encode( 'latin1')

and it printed out a sigma (totally wrong)

then I typed this:
s.encode( 'utf-8')

Then it gave me two weird characters +ñ

So how do I tell what encoding my unicode string is in, and how do I
retrieve that when I read it from a file?

=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?= · Apr 28, 2005

So how do I tell what encoding my unicode string is in, and how do I
retrieve that when I read it from a file?

In interactive mode, you best avoid non-ASCII characters in a Unicode
literal.

In theory, Python should look at sys.stdin.encoding when processing
the interactive source. In practice, various Python releases ignore
sys.stdin.encoding, and just assume it is Latin-1. What is
sys.stdin.encoding on your system?

Regards,
Martin

Christos TZOTZIOY Georgiou · May 11, 2005

In theory, Python should look at sys.stdin.encoding when processing
the interactive source. In practice, various Python releases ignore
sys.stdin.encoding, and just assume it is Latin-1. What is
sys.stdin.encoding on your system?

The difference between theory and practice is that in theory there is no
difference.

Preserving unicode filename encoding	1	Oct 20, 2012
A few questiosn about encoding	103	Jun 9, 2013
escaping/encoding/formatting in python	9	Apr 6, 2012
Encoding of surrogate code points to UTF-8	14	Oct 8, 2013
Unicode questions	17	Oct 19, 2010
Ascii to Unicode.	16	Jul 28, 2010
Unicode/ascii encoding nightmare	19	Nov 6, 2006
Ascii to Unicode.	4	Jul 28, 2010

unicode encoding problem

garykpdx

=?ISO-8859-1?Q?=22Martin_v=2E_L=F6wis=22?=

Christos TZOTZIOY Georgiou

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads