WikiDiscuss

WikiDiscuss


PEG Morphology Algorithm

posts: 1912



> > *** Sentence: muSTElaVIson 1
> > MISMATCH!
> > valfendi: >muSTE< -la VIson.
> > pegbased: -mu (STEla) VIson.
>
> It's invalid as an encoding of {mu stela vison} because the cmene is preceded

> by a brivla without a pause between them.

Is there a rule that says that a cmene can't be preceded by a brivla without
a pause between them? That would be odd, because cmene can practically
never appear after a brivla. Isn't the cmene morphology rule about the
syllable {la} rather than the cmavo {la}? And if it isn't, shouldn't it be?
The morphology should not care about what the words mean, only about their
form.

> {kybuladjan} is invalid because {ky} needs a pause after it. Both lexers,
> however, lex this as {ky bu la djan} (or so xorxes claims for camxes). The
> official rules state that the pause must be between the Cy and the next word
> that isn't Cy, but I figured out that it can be between the Cy and the next
> word that contains CVV, CV'V, or CCV, so I say {kybu.ladjan}.

Right. The official rules are more strict than they need to be here.
Both parsers have a bug with respect to the official rules, but this will
not be a bug with respect to the new official rules if they are approved.

> {kymoi}, {kybumoi}, {kybumlatu}, {lekymoi}, {lekybumoi}, and {lekybumlatu}
> are
> more phrases with the pause after the lervla missing. valfendi thinks they
> all contain brivla, but errors out trying to identify it, except for {ky bu
> mlatu}.

camxes should give:

ky moi
ky bu moi
ky bu mlatu
lekymoi (= lekmoi)
le ky bu moi
le ky bu mlatu

Again, these are bugs with respect to the official rules, which are
more strict than required for unambiguity. We could force these to be
errors, but that seems pointless.

mu'o mi'e xorxes





__
Do you Yahoo!?
Yahoo! Mail - now with 250MB free storage. Learn more.
http://info.mail.yahoo.com/mail_250