[85453] in tlhIngan-Hol
Re: recent boing boing post and Unicode
daemon@ATHENA.MIT.EDU (ghunchu'wI')
Sun May 17 10:10:52 2009
In-Reply-To: <667947.8068.qm@web82603.mail.mud.yahoo.com>
From: "ghunchu'wI'" <qunchuy@alcaco.net>
Date: Sun, 17 May 2009 10:08:22 -0400
To: tlhingan-hol@kli.org
Errors-to: tlhingan-hol-bounce@kli.org
Reply-to: tlhingan-hol@kli.org
On May 16, 2009, at 5:36 PM, Terrence Donnelly wrote:
> Writing an algorithm to convert romanized Hol to a pIqaD-friendly
> format is relatively trivial.
If you can assume the source document is well-formed, and that it
contains Klingon letters only, then it's easy. It's not much harder
to include a check for unreasonable alphabetic characters (e.g.
"k"). But what do you do with punctuation? Commas and periods have
a traditional mapping to up- and down-pointing triangles, but that's
it. Do you map question marks to periods and rely on grammar to
convey the difference? Do you count semicolons and dashes as comma-
like pauses as well? What do you do with quotations?
Mixed-language source introduces an entirely different set of
challenges, with the question of how to resolve them being somewhat
more philosophical than technical. http://www.kli.org/QQ/QQ0407.html?
mode=XIFAN shows one possibility, though there are still glitches.
-- ghunchu'wI'