[95473] in tlhIngan-Hol

home help back first fref pref prev next nref lref last post

Re: [Tlhingan-hol] Fwd: RE: Klingon Scrabble

daemon@ATHENA.MIT.EDU (Rohan Fenwick - QeS 'utlh)
Sun Jan 6 07:49:57 2013

From: Rohan Fenwick - QeS 'utlh <qeslagh@hotmail.com>
To: <tlhingan-hol@kli.org>
Date: Sun, 6 Jan 2013 22:49:39 +1000
In-Reply-To: <6.2.5.6.2.20130105203737.0577da00@flyingstart.ca>
Errors-To: tlhingan-hol-bounces@stodi.digitalkingdom.org

--===============1883155973398604638==
Content-Type: multipart/alternative;
	boundary="_ec6e560e-50e8-450c-99f5-93e3192685f0_"

--_ec6e560e-50e8-450c-99f5-93e3192685f0_
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable


I've been out of the loop for a while so haven't been able to say anything =
much in the thread=2C but:

jatlh Qov:
>=20

It's not very many words you have there=2C so probably only 'ay''a' wa'. It
would be interesting to compare to Hamlet or any other
manuscript.

luq=2C qaH. :) For Qov's (and others') interest=2C I've just done a letter-=
frequency using maHvatlh's tool on what I've done so far for mIl'oD veDDIr =
SuvwI'. This comprises 80 'ay'mey=2C and around 20=2C700 words up to the mo=
ment. Since I've rendered all names into Klingon phonotactics=2C some of th=
e frequencies will be skewed slightly by names=2C but here are the results=
=2C in decreasing order of frequency:

Total phonemes surveyed: 122=2C250



'           13473

a          13059

I           7731

e          7704

H         7182

o          7097

u          6421

j           6216

v          4958

m         4363

D         3949

l           3759

gh        3702

t           3528

q          3481

ch        3342

b          3199

S          3014

p          2820

n          2656

w         2411

y          2021

tlh        1807

r          1791

Q         1677

ng        889
/m/ is slightly overrepresented and /b/ and /D/ slightly underrepresented=20
because of a couple of short exchanges in Krotmag dialect=2C but there are
 large enough gaps between /m/=2C /b/ and /D/ and their neighbours that sta=
ndardising the Krotmag exchanges won't alter the relative distributions. I =
haven't included /N/-counts for this reason.

The comparison to the list from nuq bop bom is intriguing. For starters=2C =
in my text qaghwI' is the most frequent of ALL letters! This agrees well wi=
th Qov's assertion that the Scrabble distribution needs more qaghwI'mey. Th=
e letter with the biggest discrepancy between our lists is /t/=2C which is =
19th in Qov's list but 14th in mine. I'm quite sure this is because three o=
f the main characters have names with /t/ in them: 'avtanDIl=2C taryel=2C a=
nd tIna'tIn. Removing these makes /t/ drop in frequency to 16th (below /ch/=
). The two other big discrepancies are /gh/ and /v/=2C both of which are mo=
re common in my list: /v/ and /gh/ are 9th and 13th in mine=2C compared to =
12th and 17th in Qov's. I'd put this down to narrative style=2C as a great =
deal of mIl'oD veDDIr SuvwI' is people talking to and about themselves: the=
re's likely a lot of vI- and -'egh causing this. The frequency of /j/ also =
agrees with that=2C as does the fact that /I/ (in both /jI-/ and /vI-/) out=
ranks both /e/ and /o/=2C where it doesn't for nuq bop bom.

All my other letters=2C though=2C differ by two places or less from Qov's l=
ist. /ng/ is last for me too=2C by a very considerable margin. The fact tha=
t /ch/ is actually *less* frequent for me is odd=2C though=2C because I hap=
pen to know that my natural form for "but" (one of the most frequent words =
in written text) is /'ach/ while Qov more usually seems to uses /'a/.

QeS
 		 	   		  =

--_ec6e560e-50e8-450c-99f5-93e3192685f0_
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<html>
<head>
<style><!--
.hmmessage P
{
margin:0px=3B
padding:0px
}
body.hmmessage
{
font-size: 10pt=3B
font-family:Tahoma
}
--></style></head>
<body class=3D'hmmessage'><div dir=3D'ltr'>
I've been out of the loop for a while so haven't been able to say anything =
much in the thread=2C but:<br><br>jatlh Qov:<br>&gt=3B=20

It's not very many words you have there=2C so probably only 'ay''a' wa'. It
would be interesting to compare to Hamlet or any other
manuscript.<br><br>luq=2C qaH. :) For Qov's (and others') interest=2C I've =
just done a letter-frequency using maHvatlh's tool on what I've done so far=
 for mIl'oD veDDIr SuvwI'. This comprises 80 'ay'mey=2C and around 20=2C700=
 words up to the moment. Since I've rendered all names into Klingon phonota=
ctics=2C some of the frequencies will be skewed slightly by names=2C but he=
re are the results=2C in decreasing order of frequency:<br><br>Total phonem=
es surveyed: 122=2C250<br><!--[if gte mso 9]><xml>
 <w:WordDocument>
  <w:View>Normal</w:View>
  <w:Zoom>0</w:Zoom>
  <w:TrackMoves/>
  <w:TrackFormatting/>
  <w:PunctuationKerning/>
  <w:ValidateAgainstSchemas/>
  <w:SaveIfXMLInvalid>false</w:SaveIfXMLInvalid>
  <w:IgnoreMixedContent>false</w:IgnoreMixedContent>
  <w:AlwaysShowPlaceholderText>false</w:AlwaysShowPlaceholderText>
  <w:DoNotPromoteQF/>
  <w:LidThemeOther>EN-AU</w:LidThemeOther>
  <w:LidThemeAsian>X-NONE</w:LidThemeAsian>
  <w:LidThemeComplexScript>AR-SA</w:LidThemeComplexScript>
  <w:Compatibility>
   <w:BreakWrappedTables/>
   <w:SnapToGridInCell/>
   <w:WrapTextWithPunct/>
   <w:UseAsianBreakRules/>
   <w:DontGrowAutofit/>
   <w:SplitPgBreakAndParaMark/>
   <w:DontVertAlignCellWithSp/>
   <w:DontBreakConstrainedForcedTables/>
   <w:DontVertAlignInTxbx/>
   <w:Word11KerningPairs/>
   <w:CachedColBalance/>
  </w:Compatibility>
  <w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel>
  <m:mathPr>
   <m:mathFont m:val=3D"Cambria Math"/>
   <m:brkBin m:val=3D"before"/>
   <m:brkBinSub m:val=3D"--"/>
   <m:smallFrac m:val=3D"off"/>
   <m:dispDef/>
   <m:lMargin m:val=3D"0"/>
   <m:rMargin m:val=3D"0"/>
   <m:defJc m:val=3D"centerGroup"/>
   <m:wrapIndent m:val=3D"1440"/>
   <m:intLim m:val=3D"subSup"/>
   <m:naryLim m:val=3D"undOvr"/>
  </m:mathPr></w:WordDocument>
</xml><![endif]-->

<p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"><span =
style=3D"line-height: 115%=3B"><br></span></font></p><p class=3D"MsoNormal"=
><font style=3D"font-size: 10pt=3B" size=3D"2"><span style=3D"line-height: =
115%=3B">'<span style=3D"mso-tab-count:1">&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&=
nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>13473</span></font><=
/p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">a<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>13059</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">I<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=
=3B </span>7731</span></font></p><font style=3D"font-size: 10pt=3B" size=3D=
"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">e<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>7704</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">H<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>7182</s=
pan></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">o<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>7097</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">u<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>6421</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">j<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=
=3B </span>6216</span></font></p><font style=3D"font-size: 10pt=3B" size=3D=
"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">v<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>4958</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">m<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>4363</s=
pan></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">D<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>3949</s=
pan></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">l<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=
=3B </span>3759</span></font></p><font style=3D"font-size: 10pt=3B" size=3D=
"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">gh<span style=3D"mso-tab-count:1">&nb=
sp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>3702</span></f=
ont></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">t<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=
=3B </span>3528</span></font></p><font style=3D"font-size: 10pt=3B" size=3D=
"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">q<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>3481</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">ch<span style=3D"mso-tab-count:1">&nb=
sp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>3342</span></f=
ont></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">b<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>3199</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">S<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>3014</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">p<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>2820</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">n<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>2656</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">w<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>2411</s=
pan></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">y<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>2021</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">tlh<span style=3D"mso-tab-count:1">&n=
bsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>1807</span></=
font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">r<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span=
>1791</span></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">Q<span style=3D"mso-tab-count:1">&nbs=
p=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>1677</s=
pan></font></p><font style=3D"font-size: 10pt=3B" size=3D"2">

</font><p class=3D"MsoNormal"><font style=3D"font-size: 10pt=3B" size=3D"2"=
><span style=3D"line-height: 115%=3B">ng<span style=3D"mso-tab-count:1">&nb=
sp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B&nbsp=3B </span>889</span></fo=
nt></p><br>/m/ is slightly overrepresented and /b/ and /D/ slightly underre=
presented=20
because of a couple of short exchanges in Krotmag dialect=2C but there are
 large enough gaps between /m/=2C /b/ and /D/ and their neighbours that sta=
ndardising the Krotmag exchanges won't alter the relative distributions. I =
haven't included /N/-counts for this reason.<br><br>The comparison to the l=
ist from nuq bop bom is intriguing. For starters=2C in my text qaghwI' is t=
he most frequent of ALL letters! This agrees well with Qov's assertion that=
 the Scrabble distribution needs more qaghwI'mey. The letter with the bigge=
st discrepancy between our lists is /t/=2C which is 19th in Qov's list but =
14th in mine. I'm quite sure this is because three of the main characters h=
ave names with /t/ in them: 'avtanDIl=2C taryel=2C and tIna'tIn. Removing t=
hese makes /t/ drop in frequency to 16th (below /ch/). The two other big di=
screpancies are /gh/ and /v/=2C both of which are more common in my list: /=
v/ and /gh/ are 9th and 13th in mine=2C compared to 12th and 17th in Qov's.=
 I'd put this down to narrative style=2C as a great deal of mIl'oD veDDIr S=
uvwI' is people talking to and about themselves: there's likely a lot of vI=
- and -'egh causing this. The frequency of /j/ also agrees with that=2C as =
does the fact that /I/ (in both /jI-/ and /vI-/) outranks both /e/ and /o/=
=2C where it doesn't for nuq bop bom.<br><br>All my other letters=2C though=
=2C differ by two places or less from Qov's list. /ng/ is last for me too=
=2C by a very considerable margin. The fact that /ch/ is actually *less* fr=
equent for me is odd=2C though=2C because I happen to know that my natural =
form for "but" (one of the most frequent words in written text) is /'ach/ w=
hile Qov more usually seems to uses /'a/.<br><br>QeS<br> 		 	   		  </div><=
/body>
</html>=

--_ec6e560e-50e8-450c-99f5-93e3192685f0_--


--===============1883155973398604638==
Content-Type: text/plain; charset="us-ascii"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

_______________________________________________
Tlhingan-hol mailing list
Tlhingan-hol@stodi.digitalkingdom.org
http://stodi.digitalkingdom.org/mailman/listinfo/tlhingan-hol

--===============1883155973398604638==--


home help back first fref pref prev next nref lref last post