Python, Dutch, English, Chinese, Japanese, etc.

S

Steve Howell

The never-ending debate about PEP 3131 got me thinking
about natural languages with respect to Python, and I
have a bunch of mostly simple observations (some
factual, some anecdotal). I present these mostly as
food for thought, but I do make my own
continent-by-continent recommendations at the bottom
of the email. (My own linguistic biases are also
disclosed at the bottom of the email.)

Nationality of various technologists who use English
to some degree (keywords in their languages, etc.):

van Rossum -- Dutch-born, now lives in California
Wall -- American
Matz -- Japanese
Ritchie -- American
Stroustroup -- Danish-born, lives in Texas
Gosling -- Canadian
McCarthy -- American
Torvalds -- Finnish-born (but family spoke
Swedish), lives in Oregon
Stallman -- American
Berners-Lee -- English-born, did major work in
Geneva

A sampling of largish countries where English is
fairly widely known:

United States (82% of inhabitants speak it at
home), Canada, United Kingdom, Australia, South
Africa, India

about China:
largest country in the world by population
Mandarin Chinese has 850 million speakers
written Chinese dates back 4000 years, employs
5000 characters

about India:

second largest country in the world by population
official languages: Hindi, English, and 21 others
major software outsourcing center (anecdotal)
Hindi is Indo-European language with distinctively
different alphabet from English

about Japan:
10th largest population
world leader in robotics
Japanese language mostly spoken in Japan
major linguistic influences: Chinese, English,
Dutch
kanji = Chinese characters
hiragana and katakana -- syllabic scripts
Latin alphabet often used in modern Japanese (see
wikipedia)

some European alphabets:

Spanish -- accented, includes digraphs ch and ll
German -- accented
French -- accented
Italian -- accented, no J/K/W/X/Y

Bringing Python to the world (all opinion here):

Even in English-speaking countries, Python is
greatly underutilized.

Even in environments where programmers commonly use
ASCII encoding, Python is greatly underutilized.

Any focus on the current English/ASCII bias of
Python should mostly concern Asia, due its large
population, the 80/20 rule, the prevalence of
different writing systems in large Asian countries,
Asia's influence on technology in general, etc. (not
to mention Ruby!)

Asia:

Python should be *completely* internationalized for
Mandarin, Japanese, and possibly Hindi and Korean.
Not just identifiers. I'm talking the entire
language, keywords and all.

Europe:

Lobby EU for more funding for PyPy. Promote
cultural acceptance of English-ized spelling in the
context of writing software programs.

North America:

Marketing, marketing, marketing.

South America:

Focus first on translating Python documents, books,
etc. to Spanish.

Africa:

write Python code for the XO-1 (aka $100 laptop)

Australia:

no worries

Antartica:

more Penguins than people

My linguistic biases:

1) I speak American English natively.
2) I live in a very multilingual city.
3) I took 6 years of French in high school, but I
get very little exposure to the language in my
day-to-day life.
4) I hear a LOT of Spanish in day-to-day life, and
I have first semester literacy.
5) I have never learned Arabic, Mandarin,
Japanese, just to name a few major world languages.
6) I have written software that has been
translated from English to other languages, but I only
once been the primary person to do the actual
internationalization, and it was a small project.
7) Lots of U.S.-based programmers that I have
worked with speak English as their second or third
language.











____________________________________________________________________________________
Yahoo! oneSearch: Finally, mobile search
that gives answers, not web links.
http://mobile.yahoo.com/mobileweb/onesearch?refer=1ONXIC
 
R

Ross Ridge

Steve Howell said:
about Japan:
major linguistic influences: Chinese, English,
Dutch

English and Dutch are minor linguistic influences.
kanji = Chinese characters
hiragana and katakana -- syllabic scripts
Latin alphabet often used in modern Japanese (see
wikipedia)

The Latin alphabet is generally only used for western or westernized
names, like Sony.
Asia:

Python should be *completely* internationalized for
Mandarin, Japanese, and possibly Hindi and Korean.
Not just identifiers. I'm talking the entire
language, keywords and all.

This would be more convincing if it came from someone who spoke Mandarin,
Japanese, Hindi or Korean.

btw. Mandarin is a spoken dialect Chinese, what you're actually asking
for is a Simplified-Chinese version of Python.

Ross Ridge
 
M

montyphyton

Steve Howell je napisao/la:
some European alphabets:

Spanish -- accented, includes digraphs ch and ll
German -- accented
French -- accented
Italian -- accented, no J/K/W/X/Y

what about slavic languages?
in croatian you have five accented letters plus three letters for
digrahps. russian, bulgarian, serbian, macedonian, ukranian etc. use
cyrilic alphabet (lets not forget that russia isn't that small -
around 150 million people), polish also has some of its own
characters...
all in all, it is estimated that some 400 million people speak slavic
languages...
 
M

Méta-MCI

Et le klingon ?

Please, don't forget klingons
SVP, n'oubliez pas les klingons

;o)
 
M

montyphyton

Agreed, but FWIW, if you compared Slavic-writing
people to Chinese-writing people, I would think that a
higher percentage of Slavic-writing people would be
bilingual in terms of their ability to write code in
non-Slavic alphabets, due to various
cultural/geographical factors.

of course. but maybe it would be a nice effort to enable writing code
in cyrillic, since it is a whole new alphabet. for the accented
letters from slavic (or other) languages, i agree that one wouldn't
gain much from enabling their use in source code.
but my point being, if we are going to add chinese and japanese, why
not do everything right and add all languages/alphabets? after all,
after adding chinese, how hard can it be to add a few accedented
letters :)
I don't predict a huge upswing in Slavic-writing
Python programmers after PEP 3131, even among
children.

you are probably right.
 
M

montyphyton

Méta-MCI je napisao/la:
Et le klingon ?

Please, don't forget klingons
SVP, n'oubliez pas les klingons

;o)

je pense que le klingon utilise les mems lettres comme l'anglais
 
C

Cameron Laird

English and Dutch are minor linguistic influences.
.
.
.
Korean's arguably more important in Japanese philology than
Dutch. Portuguese and Spanish are also present, at a level
somewhat below that of Dutch.
 
O

olive

Lol!

What is a "sharp hair boss" ?

My boss does not look like a punk !

But he does want me to dance "la Java".
 
A

ahlongxp

English and Dutch are minor linguistic influences.
Obviously. But language evolves.
This would be more convincing if it came from someone who spoke Mandarin,
Japanese, Hindi or Korean.
I'm a Chinese.
Language/English is really a big problem for Chinese programmers.
If python can be written in Chinese, it may become the most popular
program language in China(though popular alreay).
Considering the potential large amount of users in China, the effort
of internationalization for Chinese will totally worth.
btw. Mandarin is a spoken dialect Chinese, what you're actually asking
for is a Simplified-Chinese version of Python.

Mandarin is not a friendly way of saying Chinese and it is totally
unacceptable in some area.
Either Simplified Chinese or Traditional Chinese will be better.

and last but not least, python ROCKS.
 
C

Cameron Laird

.
.
.
I'm a Chinese.
Language/English is really a big problem for Chinese programmers.
If python can be written in Chinese, it may become the most popular
program language in China(though popular alreay).
Considering the potential large amount of users in China, the effort
of internationalization for Chinese will totally worth.
.
.
.
Tcl can be (more-or-less) written in Chinese now. How popular is it
among Chinese-speaking developers?
 
A

ai

I am not a native English speaker. But I totally do not support PEP
3131. If a program is written in English and commented by other
language, I am read it. But if a program is written in other language,
it will be full unreadable by me even it is commented by English. I
think language is just a tool used for intercommunion. The best
situation is all people use only one language.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,968
Messages
2,570,153
Members
46,699
Latest member
AnneRosen

Latest Threads

Top