History: BPFK Section: lerfu Shifts

Preview of version: 23

The lerfu shifts (BY1) consist of these cmavo: ga'e, ge'o, je'o, jo'o, lo'a, na'a, ru'o, se'e, to'a.

Usage observations


For past usage, I am searching through the corpus I have collected of 900 kilobytes of pure-Lojban text. The corpus includes all texts published at lojban.org/files/texts, many large texts from the Wiki, IRC logs, and texts from the CVS server such as Alice.

ga'e, to'a (case shifts)


ga'e is only used correctly in algebra, to mark variables named with capital letters. The author assumed that the shift would apply across multiple lerfu strings.

to'a is never used correctly.

lapoi pelxu ku'o trajynobli contains the sentence ".itu'e ga'e ca cpedu fi do to'a". ga'e and to'a here both act as pro-sumti, which was probably not intended. Here, ga'e and to'a were probably intended to "capitalize" (emphasize) the words between them, but as lerfu modifiers they cannot modify the emphasis of words.

ge'o, je'o, jo'o, lo'a, ru'o (alphabet shifts)


None of these are used anywhere in the corpus, except that the utterance "zo ru'o" appeared on IRC in response to a line of Russian text.

zai has not been assigned to this section, but it has a similar function to the above cmavo. It is also not used anywhere in the corpus.

na'a (cancel shifts)


This word is not used in the corpus, though nau was used in the algebra text where na'a was probably intended.

se'e (character code)


This word is not used in the corpus.

Proposed definitions

ga'e
Converts future letterals to uppercase. The change applies until it is shifted back with to'a or cancelled with na'a.
na'a
Cancels all shifts (font, case, etc.) currently applied to letterals. Any shifts that occur earlier in the text do not affect letterals from this point on.
se'e
Convert the next sequence of digits to a character code in ASCII, Unicode, or some other agreed-upon character set. The code includes all digits until the next non-digit, the end of the letteral sequence, or na'a.
to'a
Converts future letterals to lowercase. The change applies until it is shifted back with ga'e or cancelled with na'a.
ge'o
Converts future letterals to the Greek alphabet. The change applies until it is shifted by je'o, jo'o, lo'a, or ru'o, or cancelled with na'a.
je'o
Converts future letterals to the Hebrew alphabet. The change applies until it is shifted by ge'o, jo'o, lo'a, or ru'o, or cancelled with na'a.
jo'o
Converts future letterals to the Arabic alphabet. The change applies until it is shifted by je'o, ge'o, lo'a, or ru'o, or cancelled with na'a.
lo'a
Converts future letterals to the Lojban (Roman) alphabet. The change applies until it is shifted by je'o, jo'o, ge'o, or ru'o, or cancelled with na'a.
ru'o
Converts future letterals to the Russian (Cyrillic) alphabet. The change applies until it is shifted by je'o, jo'o, lo'a, or ge'o, or cancelled with na'a.

Proposed keywords


ga'e: uppercase shift
na'a: cancel shifts
se'e: character code
to'a: lowercase shift
ge'o: Greek shift
je'o: Hebrew shift
jo'o: Arabic shift
lo'a: Lojban shift, Roman shift
ru'o: Russian shift, Cyrillic shift

Changes

Clarification of scope


The scope of a letteral shift needs to be defined. I will elaborate on Arnt's specification in BPFK Section: lerfu Forming cmavo, also following the "Microsoft Word model" specified at Interpretive conventions for lerfu formatting cmavo.

A letteral shift lasts until another shift of the same type replaces it, or it is cancelled by na'a.

(The sole usage of ga'e assumed that it would not end at the end of a lerfu string.)

It is not so far specified where a se'e construct should end; I propose that it should be able to be terminated with na'a, because na'a terminates other sorts of shifts.

One possible interpretive convention for these cmavo (apparently intended by the founders), is that a parenthetical shift or font-and-face change that is not followed by lerfu would be taken as applying to whole words - sort of like a mark-up language. For example: "to'i ga'e toi mi to'i to'a toi klama" would be "MI klama".

Omission of unused cmavo


Given that Lojban does not seem to be intended for holding multilingual spelling bees, and that a dictionary containing many unused cmavo with bizarre functions could confuse learners of the language, the BPFK does not recommend to include the alphabet shifts (ge'o, je'o, jo'o, lo'a, ru'o) in learning materials intended even for advanced learners. The cmavo should not be reassigned to have other meanings, however.

Impact


The clarifications made to the scope of lerfu shifts give a consistent model of how shifts should be applied, and do not invalidate any known usage.

I believe that my scope clarifications are consistent with those in BPFK Section: lerfu Forming cmavo, even though that page says otherwise.

Given the lack of usage of alphabet shifts, omitting the unused alphabet shifts from learning materials should not have any significant impact on the language.

History

Information Version
Sun 08 of Jun, 2014 19:30 GMT mukti from 216.194.27.154 30
Tue 19 of Oct, 2010 00:58 GMT lindarthebard from 32.172.136.135 29
Tue 19 of Oct, 2010 00:50 GMT lindarthebard from 32.172.136.135 28
Tue 19 of Oct, 2010 00:47 GMT lindarthebard from 32.172.136.135 27
Fri 15 of Oct, 2010 19:55 GMT lindarthebard from 32.174.46.157 26
Tue 25 of May, 2004 03:03 GMT admin from 64.81.49.171 Page unlocked 25
Tue 25 of May, 2004 03:03 GMT admin from 64.81.49.171 Page locked 24
Tue 25 of May, 2004 03:03 GMT admin from 64.81.49.171 Page unlocked 23
Tue 25 of May, 2004 03:03 GMT admin from 64.81.49.171 22
Sun 14 of Mar, 2004 08:51 GMT admin from 67.101.149.154 21
Fri 30 of Jan, 2004 18:59 GMT rab.spir from 24.128.38.52 20
Fri 30 of Jan, 2004 04:48 GMT rab.spir from 18.208.0.57 19
Fri 30 of Jan, 2004 04:47 GMT rab.spir from 18.208.0.57 18
Fri 30 of Jan, 2004 04:41 GMT rab.spir from 18.208.0.57 17
Sat 03 of Jan, 2004 22:00 GMT admin from 64.81.49.216 16
Sat 03 of Jan, 2004 22:00 GMT admin from 64.81.49.216 15
Thu 13 of Nov, 2003 22:14 GMT rab.spir from 18.208.0.57 14
Thu 13 of Nov, 2003 22:17 GMT arj from 129.241.210.192 12
Thu 13 of Nov, 2003 22:14 GMT rab.spir from 18.208.0.57 11
Thu 13 of Nov, 2003 22:02 GMT rab.spir from 18.208.0.57 10
Thu 13 of Nov, 2003 18:05 GMT rab.spir from 18.54.0.42 9
Thu 13 of Nov, 2003 04:27 GMT rab.spir from 18.208.0.57 8
Thu 13 of Nov, 2003 04:23 GMT rab.spir from 18.208.0.57 7
Thu 13 of Nov, 2003 00:49 GMT rab.spir from 18.208.0.57 6
Thu 13 of Nov, 2003 00:46 GMT rab.spir from 18.208.0.57 5
Wed 12 of Nov, 2003 20:37 GMT arj from 129.241.210.216 Fixed reversed order of URL/text in external link. My bad. 4