WikiDiscuss

WikiDiscuss


PEG Morphology Algorithm

posts: 1912


> On Wed, Dec 22, 2004 at 04:12:08PM -0800, Robin Lee Powell wrote:
> > On Wed, Dec 22, 2004 at 06:51:36PM -0500, Pierre Abbat wrote:
> > > If camxes and valfendi give different output on an invalid
> > > string
> > snip
> > > that is not necessarily a bug.
> >
> > That's a good point, but in many of these cases I'm going to need
> > you guys to tell me what is and is not a valid string.

I don't think that's right. If the string is invalid, both parsers
should say it is invalid. If they say anything else, it is a bug.

> For example, in this case one of you thinks it's invalid, the other
> does not:
>
> *** Sentence: muSTElaVIson 1
> MISMATCH!
> valfendi: >muSTE< -la VIson.
> pegbased: -mu (STEla) VIson.
>
> Morphologically invalid, I mean. Both cases are grammatically
> invalid.

Make it grammatically valid:

lo'u musSTElaVIson le'u lojbo valsi

> I'm pretty sure camxes is wrong on this one.

I'm not so sure. I'm inclined to say it is not wrong, because the
rules for identifying cmene are purely *morphological*. They should
not rely on identifying the preceding "la" as a gadri. Any syllable
"la" will allow the cmene to skip the initial pause.

mu'o mi'e xorxes




__
Do you Yahoo!?
All your favorites on one personal page – Try My Yahoo!
http://my.yahoo.com