utf8

Helmut Richter · May 16, 2013

Doesn't this 'delete content and reply to a more convenient
fabrication' trick become boring over time?

,----
| The sole purpose it is supposed to fulfil here is to
| suggest that an opinion about something which happens to conflict with
| some other opinion would somehow conflict with the mentioned 'good
| programming practice' without detailing how exactly.
`----

I did not like to answer your allegation of motives which are not mine.

I'm going to ignore the rest of this text because you aren't telling
the truth, you know that, I know that, and you know that I know that.

Interesting twist.

Helmut Richter · May 16, 2013

[...]

I'm going to ignore the rest of this text because you aren't telling
the truth, you know that, I know that, and you know that I know that.

Click to expand...

Addition: A discussion of the relative merits of either approach for
handling 'extended characters' could be interesting. However, I'm not
interested in trying to argue for both sides, ie, against my own
standpoint, and these "the Gods have chosen wisely and now it is for
the mortals to obey" declarations of faith (or fandom) are pointless.

My wording of "the Gods have chosen wisely and now it is for the mortals
to obey" was "I, too, have doubts that they chose the best solution."

I have only much more serious doubts that your idea to publish
implementation details as interface would have been better.

Another example: I am mostly using Emacs as text editor. I do know that
when I type the character "ä" or "§" when entering text, exactly this
character will appear in the file in the encoding I choose when saving the
file. I have no idea how this character is stored internally while emacs
is underway. And that's absolutely fine with me. Why should perl not do
likewise?

Rainer Weikusat · May 16, 2013

Helmut Richter said:
[...]

I'm going to ignore the rest of this text because you aren't telling
the truth, you know that, I know that, and you know that I know that.

Click to expand...

Addition: A discussion of the relative merits of either approach for
handling 'extended characters' could be interesting. However, I'm not
interested in trying to argue for both sides, ie, against my own
standpoint, and these "the Gods have chosen wisely and now it is for
the mortals to obey" declarations of faith (or fandom) are pointless.

Click to expand...

My wording of "the Gods have chosen wisely and now it is for the mortals
to obey" was "I, too, have doubts that they chose the best solution."

I have only much more serious doubts that your idea to publish
implementation details as interface would have been better.

But this isn't my idea, that's just a totally generic label you have
chosen to attach to a certain standpoint regarding how 'unicode
strings' should be handled. It is also wrong to refer to this as 'my
idea' since it isn't may idea and to refer to it has not published
because it *is* part of the published documentation of perl. For
instance, to this day, the perlguts manpage contains the following
text:

To fix this, some people formed Unicode, Inc. and produced a
new character set containing all the characters you can
possibly think of and more. There are several ways of
representing these characters, and the one Perl uses is called
UTF-8. UTF-8 uses a variable number of bytes to represent a
character.
http://perldoc.perl.org/perlguts.html#Unicode-Support

Another example: I am mostly using Emacs as text editor. I do know that
when I type the character "ä" or "§" when entering text, exactly this
character will appear in the file in the encoding I choose when saving the
file. I have no idea how this character is stored internally while emacs
is underway. And that's absolutely fine with me. Why should perl not do
likewise?

Because Perl is a programming language and not a text editor and
depending on the kind of program, different strategies for UTF-8
decoding might make sense. A nice discussion of this is available in
the 'Converting the tools' section of this paper:

http://plan9.bell-labs.com/sys/doc/utf.html

Cyrillic text from file - set utf8 in cmd, unknown characters output anyway	0	Nov 11, 2022
Is the pod of Encode::MIME::Header giving wrong advice?	5	Apr 23, 2014
setting binmode for empty filehandle	3	Apr 8, 2014
DBD::Oracle, Unicode, non-UTF8-non-ASCII strings	0	Jul 23, 2009
Problem with a login script, SESSION user rights and put this together so it works with the other pages and MySQL. Code examples.	2	May 5, 2023
I made a blockchain and want to make a cryptocurrency, but my code doesn't verify hash of each block	2	Jun 2, 2024
how is the string encoded	20	Jan 3, 2012
DBD::mysql used to take octets into the utf8 texts but no more inmariadb	3	Mar 13, 2011

utf8

Helmut Richter

Helmut Richter

Rainer Weikusat

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads