Jump to content

Wiktionary:Language families

From Wiktionary, the free dictionary
(Redirected from Wiktionary:Families)
For a list of all language family codes, see Wiktionary:List of families.
See Wiktionary:Languages and Wiktionary:Dialects for discussions of languages and of dialects, respectively.

Wiktionary sorts languages into families. Most families are related through descent from a common ancestor, but a few are merely categories, such as "creoles and pidgins". Wiktionary organizes language categories and derivations categories by families. Each family is represented on Wiktionary by a name and by a code.

Family codes

Wiktionary represents families by codes. Family codes are stored in Module:families along with related data. These codes are generally three letters, or three letters followed by a hyphen followed by another three letters. Exceptionally, they may be other strings. Family codes can reasonably be used in etymology templates like {{der}}, {{bor}} and {{inh}} and in the reconstruction namespace.

Genetic families

Genetic families are groups of languages which have a common ancestry. Wiktionary follows prevailing scholarship when grouping languages into genetic categories. Each genetic family has a code.

Many genetic families have an ISO 639-5 code. When one is available, it is used on Wiktionary as well. For example:

  • Austroasiatic languages (aav);
  • Celtic languages (cel);
  • Germanic languages (gem).

When a family lacks a standard ISO 639-5 code, but one of its superfamilies does have a code, Wiktionary assigns it a two part exceptional code. The first part is the ISO 639-5 code of its nearest superfamily, and the second part is a series of three lowercase letters which generally approximate the name of the family. For example:

  • Pama-Nyungan (aus-pam) — "aus" is the ISO 639-5 code for Australian languages, "pam" abbreviates "Pama-Nyungan";
  • Brythonic languages (cel-bry);
  • Jewish Aramaic (sem-jar);
  • South Bird's Head (ngf-sbh).

When neither a family nor any of its superfamilies has an ISO 639-5 code, the special code "qfa" is used as the first part. The qxx code range is allowed by the ISO for private use, the latter part fa standing for "family". For example:

  • Misumalpan languages (qfa-min)

Non-genetic categories and isolates

Not all language groupings are genetic. Some groups contain languages with other common properties, or languages spoken in a certain area. The following are recognised:

  • art — Artificial or constructed languages
  • crp — Creole or pidgin languages
  • sgn — Sign languages
  • qfa-mixMixed languages
  • qfa-isoLanguage isolates
  • qfa-not — Languages that do not belong to a family in principle, because they are not true languages. "Translingual" (mul), "Undetermined" (und) and "substrate" (qfa-sub) are not languages, and cannot be considered members of any family.
    • This code is also used for groupings of languages which are treated on Wiktionary as families, but which are not related. The preceding special family codes themselves also use this code.

Family names

  • All family names should be defined as English words in their respective Wiktionary entries.
  • Each family is consistently referred by only one name, to be used in etymologies and elsewhere. When there are two or more possible names, one of them is chosen.
  • For clarity, avoid using the same name for both a family and a language if possible; for instance, a certain family may be called Japanese or Japonic, but since there is already a Japanese language, Japonic is a better name for the family. (However, some homonymy cannot be avoided: for example, there is both a Kipchak language and a Kipchak language family.)
  • Multiple names of families should also eventually be listed in the sections synonyms or alternative forms at each entry.

Discussions