baltic charset

  • Thread starter Bart van den Burg
  • Start date
B

Bart van den Burg

Hi

I'm trying to learn latvian a bit, and I wanted to put some latvian text on
the web, but I can't figure out how to specify the baltic charset... Can
anyone tell me how to do it? I figured the META tag should do it, but it
doesn't :/

thanks
Bart



<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" content="text/html; charset=windows-1257">
<TITLE>Lets textje</TITLE>
</HEAD>
<BODY>
<DIV>Es negrib peldeties, jo udens ir pârâk auksts, but es gribu pîpçt. Es
neesmu nogiris. Es ienîstu kukaiòi.</DIV>
</BODY>
</HTML>
 
D

DU

Bart said:
Hi

I'm trying to learn latvian a bit, and I wanted to put some latvian text on
the web, but I can't figure out how to specify the baltic charset... Can
anyone tell me how to do it? I figured the META tag should do it, but it
doesn't :/

thanks
Bart



<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML>
<HEAD>
<META HTTP-EQUIV="Content-Type" content="text/html; charset=windows-1257">
<TITLE>Lets textje</TITLE>
</HEAD>
<BODY>
<DIV>Es negrib peldeties, jo udens ir pârâk auksts, but es gribu pîpçt. Es
neesmu nogiris. Es ienîstu kukaiòi.</DIV>
</BODY>
</HTML>

Try:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML lang="lv">
<HEAD>
<META HTTP-EQUIV="Content-Type" content="text/html; charset=iso-8859-13">
<meta http-equiv="Content-Language" content="lv">
<meta http-equiv="Content-Style-Type" content="text/css">
<TITLE>Lets textje</TITLE>
</HEAD>
<BODY>
<DIV>Es negrib peldeties, jo udens ir pârâk auksts, but es gribu pîpçt.
Es neesmu nogiris. Es ienîstu kukaiòi.</DIV>
</BODY>
</HTML>

References:

http://www.loc.gov/standards/iso639-2/langcodes.html#kl

http://www.w3.org/International/O-charset-lang.html

http://www.vaelen.org/cgi-bin/vaelen/vaelen.cgi?topic=languagemenu-languagepacks

http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-13.TXT

I don't know of browser support though. MSDN mentions iso-8859-4 instead.

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
B

Bart van den Burg

DU said:
Try:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML lang="lv">
<HEAD>
<META HTTP-EQUIV="Content-Type" content="text/html; charset=iso-8859-13">
<meta http-equiv="Content-Language" content="lv">
<meta http-equiv="Content-Style-Type" content="text/css">
<TITLE>Lets textje</TITLE>
</HEAD>
<BODY>
<DIV>Es negrib peldeties, jo udens ir pârâk auksts, but es gribu pîpçt.
Es neesmu nogiris. Es ienîstu kukaiòi.</DIV>
</BODY>
</HTML>

Nope... doesn't work
IE, Moz and Opera still display the wrong characters :(

Bart
 
D

DU

DU said:
Try:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<HTML lang="lv">
<HEAD>
<META HTTP-EQUIV="Content-Type" content="text/html; charset=iso-8859-13">
<meta http-equiv="Content-Language" content="lv">
<meta http-equiv="Content-Style-Type" content="text/css">
<TITLE>Lets textje</TITLE>
</HEAD>
<BODY>
<DIV>Es negrib peldeties, jo udens ir pârâk auksts, but es gribu pîpçt.
Es neesmu nogiris. Es ienîstu kukaiòi.</DIV>
</BODY>
</HTML>

References:

http://www.loc.gov/standards/iso639-2/langcodes.html#kl

http://www.w3.org/International/O-charset-lang.html

http://www.vaelen.org/cgi-bin/vaelen/vaelen.cgi?topic=languagemenu-languagepacks


http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-13.TXT

I don't know of browser support though. MSDN mentions iso-8859-4 instead.

I can see pictures and read the Lithuanian text on this page with MSIE 6:
http://www.topolis.lt


DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
D

DU

Bart said:
Nope... doesn't work
IE, Moz and Opera still display the wrong characters :(

Bart

â is not a standard Latvian character AFAIK.

Then try iso-8859-4. It seems that Opera 7.x does not support iso-8859-13.
In MSIE 6 for Windows,
Start/Settings/Control Panel/Regional and language options/Advanced
tab/Code page conversion tables/28594 (ISO-8859-4 Baltic)
checkbox should be checked.

Finally, Mozilla supports iso-8859-13 character set. So you must first
make sure you're using an unicode compliant editor. (â is not a standard
Latvian character.) Unipad is a good one. You can use a built-in Latvian
virtual keyboard windows to write your webpages.

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
D

DU

DU said:
â is not a standard Latvian character AFAIK.

Then try iso-8859-4. It seems that Opera 7.x does not support iso-8859-13.
In MSIE 6 for Windows,
Start/Settings/Control Panel/Regional and language options/Advanced
tab/Code page conversion tables/28594 (ISO-8859-4 Baltic)
checkbox should be checked.

Finally, Mozilla supports iso-8859-13 character set. So you must first
make sure you're using an unicode compliant editor. (â is not a standard
Latvian character.) Unipad is a good one. You can use a built-in Latvian
virtual keyboard windows to write your webpages.

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html

This document
http://members.optusnet.com.au/~waldis/latviski2.htm
claims that Mozilla, NS 7.x and Opera 7 renders accordingly iso-8859-13
character set. So it seems to me that the problem is to use an unicode
compliant editor and then use the markup code I initially proposed.

"(...)With the new Netscape 7 and Mozilla 1.3 composers you can easily
create genuine Latvian www pages. They also print them correctly. Email
is no problem either.(...)"
"(...) This identifies the document as an iso-8859-13 html-document. It
can be displayed with Netscape 7, Mozilla, Galeon, Opera 7, and
konqueror, all of which understand iso-8859-13.(...)"

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
D

DU

http://www10.brinkster.com/doctorunclear/Netscape7/Lets_textje.html

The only missing characters are from symbols gotten from <AltGr> or
<Ctrl> and are not specific Latvian characters. I used Unipad 1.0 to
create the page.
I see no problem in using iso-8859-13. You must use an unicode compliant
editor though.

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
D

DU

DU said:
http://www10.brinkster.com/doctorunclear/Netscape7/Lets_textje.html

The only missing characters are from symbols gotten from <AltGr> or
<Ctrl> and are not specific Latvian characters. I used Unipad 1.0 to
create the page.
I see no problem in using iso-8859-13. You must use an unicode compliant
editor though.

http://www10.brinkster.com/doctorunclear/Netscape7/Lets_textje.html

I fixed the remaining problem and validated the page. The whole Latvian
keyboard, alphabet, glyphs can be rendered perfectly in iso-8859-13 in
MSIE 6 for Windows, Mozilla 1.5b, NS 7.1 and Opera 7.20.

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
T

Toby A Inkster

DU said:
Then try iso-8859-4. It seems that Opera 7.x does not support iso-8859-13.
In MSIE 6 for Windows,
Start/Settings/Control Panel/Regional and language options/Advanced
tab/Code page conversion tables/28594 (ISO-8859-4 Baltic)
checkbox should be checked.

etc, etc, etc....

That's why we have UTF-8.
 
B

Bart van den Burg

----- Original Message -----
From: "DU" <[email protected]>
Newsgroups: alt.html
Sent: Monday, September 01, 2003 3:20 AM
Subject: Re: baltic charset

http://www10.brinkster.com/doctorunclear/Netscape7/Lets_textje.html

I fixed the remaining problem and validated the page. The whole Latvian
keyboard, alphabet, glyphs can be rendered perfectly in iso-8859-13 in
MSIE 6 for Windows, Mozilla 1.5b, NS 7.1 and Opera 7.20.

/me scratches his head
I don't understand why yours works and mine doesn't...
If i set it to Baltic charenc manually in IE, it shows fine, but yours does
automatically...

http://bart.tvreclames.nl/lets.html

Bart
 
B

Bart van den Burg

Bart van den Burg said:
----- Original Message -----
From: "DU" <[email protected]>
Newsgroups: alt.html
Sent: Monday, September 01, 2003 3:20 AM
Subject: Re: baltic charset



/me scratches his head
I don't understand why yours works and mine doesn't...
If i set it to Baltic charenc manually in IE, it shows fine, but yours does
automatically...

http://bart.tvreclames.nl/lets.html

Bart

Ah never mind
The validator told me that the server sent another header than the meta tag
did, so i figured it was something server-side, and it was, cause there was
a default charset :)
thanks
Bart
 
J

Jukka K. Korpela

Bart van den Burg said:
The validator told me that the server sent another header than the
meta tag did, so i figured it was something server-side, and it was,
cause there was a default charset :)

This illustrates well why people should post URLs in their questions.
Posting a URL is more useful than posting a copy of a demo document,
since HTTP headers are often very relevant.

By the specifications, a charset parameter specified in an HTTP header
shall override anything you might say within the document itself, such as
a <meta> tag that simulates an HTTP header.

Regarding the encoding, on the WWW, I would normally recommend using an
internationally standardized encoding, rather than one vendor's special
encoding, even if that vendor is currently a market leader. Practical
considerations may sometimes lead to a different position.

The world of 8-bit ISO encodings (ISO 8859) is rather confusing as
regards to support to Latvian and, worse still, IE seems to have its own
strange ideas of it. Unless I'm missing something, IE refuses to
recognize ISO 8859-13 but knows windows-1257, which is Microsoft's
proprietary variant of the same basic idea (with some code positions that
are specifically assigned to control codes in international standards
have been taken into use for some special characters).

I would suggest considering the use of UTF-8 especially if you author for
a worldwide audience, since UTF-8 support is rather widespread whereas
support to various 8-bit encodings in browsers may vary a lot.
Alternatively, you could consider writing all non-Latin-1 characters
using character references. For general notes on selecting an encoding,
see http://ppewww.ph.gla.ac.uk/~flavell/charset/checklist
 
D

DU

Jukka said:
This illustrates well why people should post URLs in their questions.
Posting a URL is more useful than posting a copy of a demo document,
since HTTP headers are often very relevant.

I certainly agree on the posting url issue: you just brought up another
reason I did not think of.
By the specifications, a charset parameter specified in an HTTP header
shall override anything you might say within the document itself, such as
a <meta> tag that simulates an HTTP header.

Regarding the encoding, on the WWW, I would normally recommend using an
internationally standardized encoding, rather than one vendor's special
encoding, even if that vendor is currently a market leader. Practical
considerations may sometimes lead to a different position.

The world of 8-bit ISO encodings (ISO 8859) is rather confusing as
regards to support to Latvian and, worse still, IE seems to have its own
strange ideas of it. Unless I'm missing something, IE refuses to
recognize ISO 8859-13 but knows windows-1257, which is Microsoft's
proprietary variant of the same basic idea (with some code positions that
are specifically assigned to control codes in international standards
have been taken into use for some special characters).

As far as I can say, MSIE 6 for Windows does support iso-8859-13. But I
can not find it listed in
Start/Settings/Control Panel/Regional and language options/Advanced
tab/Code page conversion tables
My guess is that MSIE 6 just as Mozilla-based browsers support
iso-8859-4, iso-8859-13 and windows-1257.
I would suggest considering the use of UTF-8 especially if you author for
a worldwide audience, since UTF-8 support is rather widespread whereas
support to various 8-bit encodings in browsers may vary a lot.
Alternatively, you could consider writing all non-Latin-1 characters
using character references. For general notes on selecting an encoding,
see http://ppewww.ph.gla.ac.uk/~flavell/charset/checklist

DU
--
Javascript and Browser bugs:
http://www10.brinkster.com/doctorunclear/
- Resources, help and tips for Netscape 7.x users and Composer
- Interactive demos on Popup windows, music (audio/midi) in Netscape 7.x
http://www10.brinkster.com/doctorunclear/Netscape7/Netscape7Section.html
 
A

Andreas Prilop

Bart van den Burg said:
I don't understand why yours works and mine doesn't...
If i set it to Baltic charenc manually in IE, it shows fine, but yours does
automatically...
http://bart.tvreclames.nl/lets.html

A better idea than to apply the Microsoft-proprietary encoding
"charset=Windows-1257", which does *not* work in Macintosh und Unix
Netscapes 4.x and probably other browsers, is to write all special,
non-ASCII characters in the form with "charset=UTF-8".
See http://www.unics.uni-hannover.de/nhtcapri/multilingual2.html
http://ppewww.ph.gla.ac.uk/~flavell/charset/checklist.html#s6
http://ppewww.ph.gla.ac.uk/~flavell/charset/quick#cons
Such encoded Baltic characters will be displayed even in Macintosh
and Unix Netscape 4.0.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,077
Messages
2,570,568
Members
47,204
Latest member
abhinav72673

Latest Threads

Top