downcase/uppercase for non-English characters

V

Victor Shepelev

Hello.

Can subj be done for 1-byte (non-Unicode) encodings, particularly
windows-1251 (Russian).

Are there some interpretor options or third-party libraries for
locale-specific operations?

Thanks.

Victor.
 
S

serbulentu

Hi

As Turkish users we have same problems not only in windows-1254 also
utf-8 and i think as a japanese user Matz have same problems also. I
read a solution in a turkish forum may be u can use it. They says for
unicode use jcode=u and write your own regex for downcase/uppercase
functions. So if yu do that inthe ruby code all libraries which use
that function will use your function. But this problem will be fixed in
2.0 version of Ruby.

Serbulent UNSAL
 
V

Victor Shepelev

As Turkish users we have same problems not only in windows-1254 also
utf-8 and i think as a japanese user Matz have same problems also.

Hmmm... It seems to me Japanese has no upper/downcases.
I
read a solution in a turkish forum may be u can use it. They says for
unicode use jcode=u and write your own regex for downcase/uppercase
functions. So if yu do that inthe ruby code all libraries which use
that function will use your function.


Yes, I know how to use Unicode (but I don't want), and know how to write
custom upper/downcase via String.tr (and I've already wrote). What I can't
handle, is case-insensitive Regexp-matching :-\
But this problem will be fixed in
2.0 version of Ruby.

Would hope, but can't wait :)

In any case, thanks for your help.
Serbulent UNSAL

Victor.
 
P

Paul Duncan

--yda//aIZIMrd7mir
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

* Victor Shepelev ([email protected]) said:
Hello.
=20
Can subj be done for 1-byte (non-Unicode) encodings, particularly
windows-1251 (Russian).
=20
Are there some interpretor options or third-party libraries for
locale-specific operations?

The Unicode property support that can be compiled in to PCRE. I know
There are PCRE bindings for Ruby as well, although I have no idea how
functional they are.

Obviously in order for this to work, you'd need to use Iconv to
convert the data into PCRE-friendly UTF-8.

--=20
Paul Duncan <[email protected]> pabs in #ruby-lang (OPN IRC)
http://www.pablotron.org/ OpenPGP Key ID: 0x82C29562

--yda//aIZIMrd7mir
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: Digital signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.0 (GNU/Linux)

iD8DBQFEKZLEzdlT34LClWIRAmDaAJ9+I4aiEFHVQfhtUCb2rMa/C5qKHACfVzB9
5GOEoMsTnzzpCNJzyGlXlJ4=
=TnSL
-----END PGP SIGNATURE-----

--yda//aIZIMrd7mir--
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,204
Messages
2,571,062
Members
47,669
Latest member
johnmaxwell

Latest Threads

Top