[84312] in tlhIngan-Hol
Re: Klingon phonology in regular expressions
daemon@ATHENA.MIT.EDU (David Trimboli)
Thu Mar 27 12:34:49 2008
Date: Thu, 27 Mar 2008 12:32:56 -0400
From: David Trimboli <david@trimboli.name>
In-reply-to: <f60fe000803241929k7ada25fbre40be5f92c7e0578@mail.gmail.com>
To: tlhingan-hol@kli.org
Errors-to: tlhingan-hol-bounce@kli.org
Reply-to: tlhingan-hol@kli.org
Mark J. Reed wrote:
> Gah. Let me try that again.
>
> $ gzcat tlhIngan.gz |
> perl -ne 's/tlh\b/ts/g; s/\btlh/ghl/g; s/\bH/h/g; s/H\b//g; s/\bQ/H/g;
> print if /(success|Klingon) \(n\)|yes,
> true.*excl|with honor \(adv\)/'
>
>
> Which yields this result:
>
> hISla yes, true (answer to yes/no question) (excl)
> hIja' yes, true (answer to yes/no question) (excl)
> Hapla' success (n)
> bats honor (n); honored, with honor (adv)
> ghlIngan Klingon (n)
But that'll only work with words without prefixes or suffixes, right? It
won't deal correctly with, say, {jItlhutlh}, which will end up as
{jItlhuts} instead of {jIghluts}.
I think you can get everything except the {-oy} transformation if you
replace every \b with (\b|a|e|I|o|u)
--
SuStel
Stardate 8237.0