Lojban Logo Lojban: The Logical Language Lojban Logo

If you are new to Lojban or this site, please read the help page first.

Finding Lojbanic Information: [ Help | Resources | Popular Pages | Site Map | Search ]
Basic Information About Lojban: [ FAQ | About Lojban | General LLG Information | News ]
Learning Lojban: [ Just Starting Out | Experienced | LLG Publications | Texts In Lojban ]
Helping The Lojban Community: [ Projects | LLG Committees | How You Can Help | Give Feedback ]
Languages:  [Lojban] |  [Esperanto] |  [Français] |  [Deutsch] |  [Ivrit] |  [Norsk] |  [Español] |  [Swedish] |  [English]

Lojban Etymology Information

This directory contains various Lojban etymology files, some of which are in a format suitable for analysis by the GLOTTO software written by Jacques Guy <[email protected]>.

The file lojban.voc contains a list of Lojban gismu in no particular order; the other *.voc files contain the same words in each of the six Lojban source languages. Note that the source-language words are in Lojbanized spelling rather than conventional spelling, which makes them hard to recognize. Furthermore, conventional endings have been chopped off, and affricates ("tc", "dj", "ts", "dz") have generally been reduced to simple spirants ("c", "j", "s", "z" respectively), to prevent bogus mismatches. Thus, for example, the Spanish word "hijo" appears as "ix".

The file lojban.icg contains the same data, but merged into a single file. The order of words in this file is the same as that in the *.voc files, but the words have been brought together. For each word, the languages are listed in the order Lojban-Chinese-English-Hindi-Spanish-Russian-Arabic. Each word is preceded by the letter "L" if it is Lojban or contributed (score > 0) to the Lojban gismu, or else by the first letter of its language name ("C", "E", "H", "S", "R", or "A") if it made no contribution (score = 0).

All of this data was drawn from the file finprims, which contains complete information (except for original-language representations, which only exist in hardcopy form) on the Lojban gismu (primitive roots).

The file etysample.txt contains sample etymologies for a few gismu, and may be used to get the flavor of Lojban etymologizing.

In addition, the files langstat.94 and langstat.95 are reports on the number of speakers of various world languages, as of 1994 and 1995. Earlier versions of this data were used to make weighting decisions in gismu construction.

The file eaton.zip is old Eaton data from an earlier stage of the Loglan Project. Primarily of historical interest, it was an attempt at covering all of the words in Helen Eaton's 1930's list of the most frequently used concepts in 4 European languages. A low priority project is to replace this work with updated Lojban words for each concept. Contact [email protected] for further information.


Last modified: Mon Jun 27 23:10:43 PDT 2005

Please contact us with any comments, suggestions or concerns.