History: Lojban Etymology

Preview of version: 3

This directory
contains various Lojban etymology files, some of which are in a
format suitable for analysis by the GLOTTO software written by
Jacques Guy <jguy@trl.oz.au>.

The file
lojban.voc
contains a list of Lojban gismu in no particular order; the other *.voc
files contain the same words in each of the six Lojban source
languages. Note that the source-language words are in Lojbanized
spelling rather than conventional spelling, which makes them hard to
recognize. Furthermore, conventional endings have been chopped off,
and affricates ("tc", "dj", "ts", "dz") have generally been reduced
to simple spirants ("c", "j", "s", "z" respectively), to prevent
bogus mismatches. Thus, for example, the Spanish word "hijo" appears
as "ix".

The file
lojban.icg
contains the same data, but merged into a single file. The order of
words in this file is the same as that in the *.voc files, but the
words have been brought together. For each word, the languages are
listed in the order
Lojban-Chinese-English-Hindi-Spanish-Russian-Arabic. Each word is
preceded by the letter "L" if it is Lojban or contributed (score
> 0) to the Lojban gismu, or else by the first letter of its
language name ("C", "E", "H", "S", "R", or "A") if it made no
contribution (score = 0).

All of this data was drawn from the file
finprims,
which contains complete information (with transliterated/transcribed original-language forms) on the Lojban gismu (primitive roots).

The original-language representations exist only in hardcopy form, but Mublin has been able to reconstruct most of them, with only a few uncertain or missing etymologies. See https://www.dealloc.org/~mublin/ Mublin's site.

The file
etysample.txt
contains sample etymologies for a few gismu, and may be used to get
the flavor of Lojban etymologizing.

In addition, the files
langstat.94
and
langstat.95
are reports on the number of speakers of various world languages, as
of 1994 and 1995. Earlier versions of this data were used to make
weighting decisions in gismu construction.

The file
eaton.zip
is old Eaton data from an earlier stage of the Loglan Project.
Primarily of historical interest, it was an attempt at covering all
of the words in Helen Eaton's 1930's list of the most frequently
used concepts in 4 European languages. A low priority project is to
replace this work with updated Lojban words for each concept.
Contact
lojban@lojban.org
for further information.

History

Advanced
Information Version
Sat 15 of Dec, 2012 20:46 GMT zort from 70.29.75.104 Formatted the list nicer. 6
Sat 29 of Nov, 2008 15:08 GMT arj from 84.38.152.40 5
Sat 29 of Nov, 2008 15:04 GMT arj from 84.38.152.40 4
Sat 29 of Nov, 2008 15:02 GMT arj from 84.38.152.40 Added Mublin's original-form etymologies, which would otherwise not appear on Google 3
Sun 04 of Sep, 2005 05:48 GMT rlpowell from 64.81.49.171 2