Skip to contents

A dataset containes the original catalogue of languages of the world involving genealogical affiliation, macro-area, country, iso code, and coordinates.

Usage

glottolog

Format

A data frame with 26879 rows and 10 variables:

glottocode

languoid code from Glottolog 5.0

language

name of the language

iso

code based on ISO 639–3 https://iso639-3.sil.org/

level

languoid type: dialect or language (possible values are dialect, language, family, bookkeeping, pseudo family, sign language, unclassifiable, pidgin, unattested, artificial language, speech register, mixed language)

area

have six values Africa, Australia, Eurasia, North America, Papunesia, South America

latitude

latitude

longitude

longitude

countries

list of countries, where the language is spoken

affiliation

genealogical affiliation

subclassification

subclassification in a Newick format

Details

Hammarstrom, Harald and Forkel, Robert and Haspelmath, Martin and Bank, Sebastian. 2023. Glottolog 5.0. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.10804357 (Available online at http://glottolog.org, Accessed on 2024-03-12.)