Bamako said:
Is there still a problem with class names using non ASCII [sic] characters?
There's no problem, they're just forbidden by the language spec.
According to the JLS[1]:
An identifier is an unlimited-length sequence of Java letters and Java digits, the first of which must be a Java letter. ,,,
The Java letters include uppercase and lowercase ASCII Latin letters A-Z (\u0041-\u005a), and a-z (\u0061-\u007a), and, for historical reasons, the ASCII underscore (_, or \u005f) and dollar sign ($, or \u0024). The $ character should be used only in mechanically generated source code or, rarely, to access preexisting names on legacy systems.
include does not mean limited to...
http://java.sun.com/docs/books/jls/third_edition/html/lexical.html#3.8
Letters and digits may be drawn from the entire Unicode character set,
which supports most writing scripts in use in the world today,
including the large sets for Chinese, Japanese, and Korean. This
allows programmers to use identifiers in their programs that are
written in their native languages.
My question is how well implemented is the following section (and
whether its interpretations are mutually compatible) to be ensure
portability:
A package name component or class name might contain a character that
cannot correctly appear in a host file system's ordinary directory
name, such as a Unicode character on a system that allows only ASCII
characters in file names. As a convention, the character can be
escaped by using, say, the @ character followed by four hexadecimal
digits giving the numeric value of the character, as in the \uxxxx
escape (§3.3), so that the package name:
children.activities.crafts.papierM\u00e2ch\u00e9
which can also be written using full Unicode as:
children.activities.crafts.papierMâché
might be mapped to the directory name:
children/activities/crafts/papierM@00e2ch@00e9
If the @ character is not a valid character in a file name for some
given host file system, then some other character that is not valid in
a identifier could be used instead.