[6378] in www-talk@info.cern.ch

home help back first fref pref prev next nref lref last post

Re: Unicode and HTML

daemon@ATHENA.MIT.EDU (Christian L. Mogensen)
Thu Oct 27 17:22:12 1994

Date: Thu, 27 Oct 1994 14:19:53 +0100
Errors-To: listmaster@www0.cern.ch
Errors-To: listmaster@www0.cern.ch
Reply-To: mogens@CS.Stanford.EDU
From: "Christian L. Mogensen" <mogens@CS.Stanford.EDU>
To: Multiple recipients of list <www-talk@www0.cern.ch>

As I understood it both sides were right. :-)

LANGUAGE markup was important and an orthogonal issue to
charset marks.

So you could write: (wrong syntax but you get the idea)
<LANGUAGE FRENCH> Sacr&eacute; bleu!</LANGUAGE>
is a french exclamation.

The character set is the same, but the languages are different.

Similarly, charset markup is important.  This was not
completely resolved as far as I remember.

<LANGUAGE FRENCH CHARSET="ISO-8859-1">Sacr&eacute;</LANGUAGE>

<LANGUAGE GREEK CHARSET="ISO-xxxx-y">Zevs</LANGUAGE>

where Zevs would be the bytes rendered in the appropriate
greek character set.  Knowing the semantic nugget: this is in Greek
is a bonus, but not required for character sets to be marked.

Christian "feel free to correct this vague recollection" 


home help back first fref pref prev next nref lref last post