If you use these data please cite
- the original source
Ross, Pawley and Osmond. The lexicon of Proto Oceanic
- the derived dataset using the DOI of the particular released version you were using
This dataset is licensed under a CC-BY-4.0 license
Available online at http://hdl.handle.net/1885/106908
- Varieties: 755 (linked to 641 different Glottocodes)
- Concepts: 19,936 (linked to 0 different Concepticon concept sets)
- Lexemes: 43,026
- Sources: 584
- Synonymy: 1.02
- Cognacy: 40,925 cognates in 2,820 cognate sets (0 singletons)
- Cognate Diversity: -0.74
- Invalid lexemes: 0
- Tokens: 203,058
- Segments: 294 (119 BIPA errors, 119 CLTS sound class errors, 175 CLTS modified)
- Inventory size (avg): 16.15
- Entries missing sources: 40669/43026 (94.52%)
| Name | GitHub user | Role |
|---|---|---|
| Malcolm Ross | @malcolm42 | Author, Editor |
| Andrew Pawley | Author, Editor | |
| Meredith Osmond | Author, Editor | |
| Roger Green | Author | |
| Frantisek Lichtenberk | Author | |
| Medina Pawley | Author | |
| Ross Clark | Author | |
| Bethwyn Evans | Author | |
| Jeffrey C. Marck | Author | |
| Per Hage | Author | |
| Mae Carroll | DataCurator | |
| Robert Forkel | @xrotwang | maintainer, DataCurator |
The following CLDF datasets are available in cldf:
- CLDF Wordlist at cldf/cldf-metadata.json