PLEASE NOTE: THIS IS AN OLD VERSION. The current version is linked from The Complete Lojban Language.

4. gismu

The gismu, or Lojban root words, are those brivla representing concepts most basic to the language. The gismu were chosen for various reasons: some represent concepts that are very familiar and basic; some represent concepts that are frequently used in other languages; some were added because they would be helpful in constructing more complex words; some because they represent fundamental Lojban concepts (like ``cmavo'' and ``gismu'' themselves).

The gismu do not represent any sort of systematic partitioning of semantic space. Some gismu may be superfluous, or appear for historical reasons: the gismu list was being collected for almost 35 years and was only weeded out once. Instead, the intention is that the gismu blanket semantic space: they make it possible to talk about the entire range of human concerns.

There are about 1350 gismu. In learning Lojban, you need only to learn most of these gismu and their combining forms (known as ``rafsi'') as well as perhaps 200 major cmavo, and you will be able to communicate effectively in the language. This may sound like a lot, but it is a small number compared to the vocabulary needed for similar communications in other languages.

All gismu have very strong form restrictions. Using the conventions defined in Section 1, all gismu are of the forms CVC/CV or CCVCV. They must meet the rules for all brivla given in Section 3; furthermore, they:

1)
always have five letters;
2)
always start with a consonant and end with a single vowel;
3)
always contain exactly one consonant pair, which is a permissible initial pair (CC) if it's at the beginning of the gismu, but otherwise only has to be a permissible pair (C/C);
4)
are always stressed on the first syllable (since that is penultimate).
The five letter length distinguishes gismu from lujvo and fu'ivla. (It is possible to have fu'ivla like ``spa'i'' that are five letters long, but they must have ``'''; no gismu contains ``'''.)

With the exception of five special brivla variables, ``broda'', ``brode'', ``brodi'', ``brodo'', and ``brodu'', no two gismu differ only in the final vowel. Furthermore, the set of gismu was specifically designed to reduce the likelihood that two similar sounding gismu could be confused. For example, because ``gismu'' is in the set of gismu, ``kismu'', ``xismu'', ``gicmu'', ``gizmu'', and ``gisnu'' cannot be.

Almost all Lojban gismu are constructed from pieces of words drawn from other languages, specifically Chinese, English, Hindi, Spanish, Russian, and Arabic, the six most widely spoken natural languages. For a given concept, words in the six languages that represent that concept were written in Lojban phonetics. Then a gismu was selected to maximize the recognizability of the Lojban word for speakers of the six languages by weighting the inclusion of the sounds drawn from each language by the number of speakers of that language. See Section 14 for a full explanation of the algorithm.

Here are a few examples of gismu, with rough English equivalents (not definitions):

3.1)  creka
    shirt

3.2)   lijda
    religion

3.3)   blanu
    blue

3.4)   mamta
    mother

3.5)   cukta
    book

3.6)   patfu
    father

3.7)   nanmu
    man

3.8)   ninmu
    woman

A small number of gismu were formed differently; see Section 15 for a list.