Skip to content
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion concepticondata/conceptlists.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -472,4 +472,5 @@ Soederholm-2013-420 Söderholm, Carina and Häyry, Emilia and Laine, Matti and K
Amsel-2012-559 Amsel, Ben D. and Urbach, Thomas P. and Kutas, Marta 2012 559 ratings English English https://doi.org/10.3758/s13428-012-0215-z Amsel2012 This list of 559 object concepts was rated for perceptual modalities (color, motion, auditory, olfactory, gustatory, graspability, pain) as well as familiarity. Ratings were given on an 8-point scale. 1028-1041
Eilola-2010-210 Eilola, Tiina M. and Havelka, Jelena 2010 210 ratings English, Finnish English, Finnish https://doi.org/10.3758/BRM.42.1.134 Eilola2010 This list of 210 words contains ratings of familiarity, valence, emotionality, offensiveness, and concreteness for Finnish and British English nouns, including 34 taboo words. Ratings were provided by native speakers of each language. For British English in particular, the aim of the study was to collect data comparable to the American English norms in the Affective Norms for English Words database [(Bradley & Lang, 1999)](:bib:Bradley1999). The present mappings were based on the English words. 134-140
Pache-2023-207 Pache, Matthias 2023 207 basic English Chibchan https://doi.org/10.1086/722240 Pache2023 This list was used for a comparative analysis of the Chibchan languages with the aim of revising their internal genealogical classification. The author claims that the list represents the Swadesh 207 list, however, it is unclear which list is meant exactly, since Swadesh never published a list containing 207 words. The list is likely very similar to [Comrie(1977)](:bib:Comrie1977) but uses slightly different glosses. The data for the Chibchan languages was gathered from existing sources on various Chibchan languages. 81-103
Guenther-2022-30 Günther, Fritz and Rinaldi, Luca 2022 30 norms English global https://doi.org/10.1038/s41598-022-12027-5 Guenther2022 This dataset combines cross-linguistic lexical frequency information for body-part terms with anatomical and neurobiological measures of body representation. For each concept, lexical forms are provided for a wide range of languages (Amharic, Arabic, Bengali, Chinese, Croatian, Czech, Dutch, English, German, Greek, Hebrew, Hindi, Hungarian, Italian, Japanese, Latin, Latvian, Malay, Polish, Portuguese, Russian, Somali, Spanish, Swahili, Tagalog, Tamil, Turkish, Urdu, and Yoruba). For each language, separate variables encode the base form, the plural form, overall corpus frequency, and frequency per million words, derived from language-specific corpora. Additionally, the list contains measures of cortical representational size and physical body surface area, which is encoded using anterior–posterior (AP) distinctions. Further, different definitions of "arm" were used: one unit spanning shoulder to wrist (with the hand treated separately), or upper arm (shoulder to elbow) and forearm (elbow to wrist) treated separetely. 1-13
Guenther-2022-30 Günther, Fritz and Rinaldi, Luca 2022 30 norms English global https://doi.org/10.1038/s41598-022-12027-5 Guenther2022 This dataset combines cross-linguistic lexical frequency information for body-part terms with anatomical and neurobiological measures of body representation. For each concept, lexical forms are provided for a wide range of languages (Amharic, Arabic, Bengali, Chinese, Croatian, Czech, Dutch, English, German, Greek, Hebrew, Hindi, Hungarian, Italian, Japanese, Latin, Latvian, Malay, Polish, Portuguese, Russian, Somali, Spanish, Swahili, Tagalog, Tamil, Turkish, Urdu, and Yoruba). For each language, separate variables encode the base form, the plural form, overall corpus frequency, and frequency per million words, derived from language-specific corpora. Additionally, the list contains measures of cortical representational size and physical body surface area, which is encoded using anterior–posterior (AP) distinctions. Further, different definitions of "arm" were used: one unit spanning shoulder to wrist (with the hand treated separately), or upper arm (shoulder to elbow) and forearm (elbow to wrist) treated separetely. 1-13
Oswalt-1971-100 Oswalt, Robert L. 1971 100 basic, relations English Athapaskan, Pomo, Eastern Austronesian, Japanese, Coast Salish https://www.jstor.org/stable/30029088 Oswalt1971 This list contains combined and individual Referential Similarity Index (RSI) values calculated across multiple linguistic families: Athapaskan, Pomo, Eastern Austronesian, Japanese and Coast Salish. Instead of averaging the family-level RSI scores, the combined value is calculated by first adding up the families’ underlying values and only then computing the final index. This means larger and more internally diverse families contribute more to the result than smaller ones. 421-434
2 changes: 1 addition & 1 deletion concepticondata/conceptlists/Dellert-2017-1016.tsv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
ID NUMBER RANK STABILITY_SCORE GERMAN NELEX_ID ENGLISH RUSSIAN GERMAN_ANNOTATION ENGLISH_ANNOTATION RUSSIAN_ANNOTATION CONCEPTICON_GLOSS CONCEPTICON_ID NOTE
ID NUMBER RANK NORTH_EURASIAN_STABILITY_SCORE GERMAN NELEX_ID ENGLISH RUSSIAN GERMAN_ANNOTATION ENGLISH_ANNOTATION RUSSIAN_ANNOTATION CONCEPTICON_GLOSS CONCEPTICON_ID NOTE
Dellert-2017-1016-1 1 44 -2.239237 Auge Auge::N eye глаз Anatomie anatomy анатомия EYE 1248
Dellert-2017-1016-2 2 34 -2.249194 Ohr Ohr::N ear ухо Anatomie anatomy анатомия EAR 1247
Dellert-2017-1016-3 3 149 -1.195463 Nase Nase::N nose нос Anatomie anatomy анатомия NOSE 1221
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@
},
{
"datatype": "string",
"name": "STABILITY_SCORE"
"name": "NORTH_EURASIAN_STABILITY_SCORE"
},
{
"dc:description": "gloss in GERMAN",
Expand Down
101 changes: 101 additions & 0 deletions concepticondata/conceptlists/Oswalt-1971-100.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,101 @@
ID NUMBER CONCEPTICON_ID CONCEPTICON_GLOSS ENGLISH COMBINED_STABILITY_SCORE ATHAPASKAN_STABILITY_SCORE POMO_STABILITY_SCORE EASTERN_AUSTRONESIAN_STABILITY_SCORE JAPANESE_STABILITY_SCORE COAST_SALISH_STABILITY_SCORE
Oswalt-1971-100-1 1 667 ROAD road 97.2 92.9 100 100 100 100
Oswalt-1971-100-2 2 1498 TWO two 95 100 100 100 100 49.9
Oswalt-1971-100-3 3 1247 EAR ear 92.8 100 100 64.2 100 100
Oswalt-1971-100-4 4 1248 EYE eye 92.6 100 100 100 100 25.5
Oswalt-1971-100-5 5 1392 LOUSE louse 92.2 100 100 61.2 100 100
Oswalt-1971-100-6 6 1393 HORN (ANATOMY) horn 90.7 100 100 NA 100 25.5
Oswalt-1971-100-7 7 1215 THOU thou 90.7 100 100 100 7.5 100
Oswalt-1971-100-8 8 1380 TOOTH tooth 87.9 100 100 39.3 100 100
Oswalt-1971-100-9 9 227 FISH fish 85.7 100 100 100 7.5 49.9
Oswalt-1971-100-10 10 1209 I I 83.8 100 100 65.5 7.5 100
Oswalt-1971-100-11 11 948 WATER water 82.1 86.6 100 37.6 100 100
Oswalt-1971-100-12 12 1394 BONE bone 81.1 86.6 100 69.7 100 25.5
Oswalt-1971-100-13 13 1405 NAME name 80.6 100 100 47.7 100 11
Oswalt-1971-100-14 14 778 SMOKE (EXHAUST) smoke 79.2 86.6 100 67.4 100 11
Oswalt-1971-100-15 15 1408 HEAR hear 75.2 100 65.3 35.6 100 49.9
Oswalt-1971-100-16 16 1221 NOSE nose 73.6 73.2 100 21.9 100 100
Oswalt-1971-100-17 17 1416 SIT sit 73.4 100 100 13.5 7.5 100
Oswalt-1971-100-18 18 1212 WE we 73.3 74.4 100 64.2 7.5 100
Oswalt-1971-100-19 19 1224 LIVER liver 73.2 63.5 100 64.2 100 49.9
Oswalt-1971-100-20 20 857 STONE stone 70.6 59.3 100 72 100 25.5
Oswalt-1971-100-21 21 906 TREE tree 70.6 33.8 100 100 100 NA
Oswalt-1971-100-22 22 1411 LIE (REST) lie 70.3 100 100 22.8 7.5 49.9
Oswalt-1971-100-23 23 221 FIRE fire 69.4 89.1 100 17 100 3.5
Oswalt-1971-100-24 24 1040 HAIR hair 68.1 100 68.2 17 100 11
Oswalt-1971-100-25 25 674 MOUTH mouth 67 67.4 100 0 100 100
Oswalt-1971-100-26 26 1493 ONE one 66.4 100 30.9 38.4 100 25.5
Oswalt-1971-100-27 27 1442 STAND stand 64.8 48.9 74.3 100 100 3.5
Oswalt-1971-100-28 28 744 EGG egg 62.8 90.6 100 0 2.9 NA
Oswalt-1971-100-29 29 628 LEAF leaf 62.5 84.4 20 61.2 100 25.5
Oswalt-1971-100-30 30 1235 WHO who 62.4 NA 42.4 69.7 100 49.9
Oswalt-1971-100-31 31 1277 HAND hand 61 80 31.4 13.5 100 100
Oswalt-1971-100-32 32 937 BIRD bird 60.5 11.1 100 100 100 NA
Oswalt-1971-100-33 33 1401 DRINK drink 60.5 68.7 66.6 18.1 100 NA
Oswalt-1971-100-34 34 1333 NECK neck 58.8 84.5 11.9 13.3 100 100
Oswalt-1971-100-35 35 1301 FOOT foot 58.5 86.6 100 3 7.5 25.5
Oswalt-1971-100-36 36 323 FAT (ORGANIC SUBSTANCE) fat 58.3 64.4 100 0 100 25.5
Oswalt-1971-100-37 37 1203 LONG long 56.8 81.2 44.5 14.4 100 25.5
Oswalt-1971-100-38 38 1409 SEE see 55.2 100 26.1 0 100 0
Oswalt-1971-100-39 39 1220 TAIL tail 54.9 91.3 72.2 3 7.5 25.5
Oswalt-1971-100-40 40 1233 NIGHT night 54.2 23.5 100 61.2 100 25.5
Oswalt-1971-100-41 41 683 PERSON person 53.5 48.1 66.6 28.1 100 NA
Oswalt-1971-100-42 42 1458 SAY say 52.6 100 0 0 100 25.5
Oswalt-1971-100-43 43 1228 EARTH (SOIL) earth 51.3 78.8 39.6 5.6 7.5 100
Oswalt-1971-100-44 44 1202 BIG big 50.9 100 42.2 6.7 7.5 3.5
Oswalt-1971-100-45 45 1205 TONGUE tongue 50.9 32.3 100 35.9 7.5 100
Oswalt-1971-100-46 46 1256 HEAD head 50.7 90.8 29.9 12.9 7.5 49.9
Oswalt-1971-100-47 47 1035 GOOD good 50.2 62.5 66.6 5.4 7.5 100
Oswalt-1971-100-48 48 1494 DIE die 49.7 34.3 17 100 100 25.5
Oswalt-1971-100-49 49 1214 THIS this 49.5 53.3 65.0 10.3 100 31.6
Oswalt-1971-100-50 50 1430 STAR star 49.5 69.1 7.1 39.3 100 25.5
Oswalt-1971-100-51 51 1198 MANY many 49.1 86.6 68.2 0 4.5 3.5
Oswalt-1971-100-52 52 1402 BREAST breast 48.7 NA 47.4 NA 100 0
Oswalt-1971-100-53 53 1489 CLOUD cloud 48.5 37.4 72.2 32.6 100 25.5
Oswalt-1971-100-54 54 646 ASH ashes 47.3 18.2 65.3 61.2 100 NA
Oswalt-1971-100-55 55 72 CLAW claw 45.7 NA 39.6 NA 100 3.5
Oswalt-1971-100-56 56 1335 WHITE white 44.6 52.5 17.9 0 100 100
Oswalt-1971-100-57 57 671 SAND sand 45.5 46.4 39.7 39.3 100 11
Oswalt-1971-100-58 58 1204 BARK bark 45.3 6.4 100 32.6 100 61.7
Oswalt-1971-100-59 59 1336 EAT eat 43.8 48.1 29.1 61.2 3.3 61.7
Oswalt-1971-100-60 60 946 BLOOD blood 43.3 47.1 65.3 14.4 100 25.5
Oswalt-1971-100-61 61 1371 KNEE knee 43 80.2 2.4 NA 7.5 11
Oswalt-1971-100-62 62 1253 RAIN (RAINING) rain 42 36 42.4 32.6 100 25.5
Oswalt-1971-100-63 63 163 BLACK black 40.9 58.2 25.4 0 100 25.5
Oswalt-1971-100-64 64 1201 FEATHER feather 40.9 40.7 47.4 13.1 100 25.5
Oswalt-1971-100-65 65 763 SKIN skin 40 41.2 22.1 32.6 100 25.5
Oswalt-1971-100-66 66 2009 DOG dog 39.9 54.9 0 14.9 100 49.9
Oswalt-1971-100-67 67 1231 NEW new 39.7 NA 66.6 37.6 4.5 25.5
Oswalt-1971-100-68 68 1443 WALK walk 39 NA 65.3 0 100 3.5
Oswalt-1971-100-69 69 1287 COLD cold 38.3 82.3 8.4 12.9 7.5 3.5
Oswalt-1971-100-70 70 78 THAT that 37.6 26.3 25.1 10.3 100 100
Oswalt-1971-100-71 71 962 WOMAN woman 36.2 14.5 15.7 36.2 100 100
Oswalt-1971-100-72 72 1417 KILL kill 36.1 34.4 2.4 41.1 100 NA
Oswalt-1971-100-73 73 1585 SLEEP sleep 35.8 21.1 66.6 32.6 49.9 25.5
Oswalt-1971-100-74 74 1223 HEART heart 34.3 62.1 26.5 5.9 4.5 25.5
Oswalt-1971-100-75 75 1395 ROUND round 34.3 NA 5.8 NA 100 25.5
Oswalt-1971-100-76 76 1236 WHAT what 33.7 NA 22.7 24.5 7.5 100
Oswalt-1971-100-77 77 634 MEAT meat 33.1 50.4 39.7 6.7 4.5 31.6
Oswalt-1971-100-78 78 1313 MOON moon 32.6 11.1 41.9 NA 100 NA
Oswalt-1971-100-79 79 1403 BITE bite 31.7 26.5 100 0 7.5 3.5
Oswalt-1971-100-80 80 670 ROOT root 30.7 NA 7.7 9.4 100 49.9
Oswalt-1971-100-81 81 1240 NOT not 30.4 NA 25.9 0 100 NA
Oswalt-1971-100-82 82 2102 BURN burn 29.6 33.6 17.9 0 100 25.5
Oswalt-1971-100-83 83 1439 SWIM swim 29.5 35.7 7.4 17.0 100 3.5
Oswalt-1971-100-84 84 1286 HOT hot 29.4 15.3 45.4 21.1 100 0
Oswalt-1971-100-85 85 1446 COME come 28.5 NA 35.6 0 100 0
Oswalt-1971-100-86 86 639 MOUNTAIN mountain 27.8 12.2 65.3 17.7 35 NA
Oswalt-1971-100-87 87 156 RED red 27.1 25.2 25.4 7.9 100 3.5
Oswalt-1971-100-88 88 1425 GREEN green 26.9 NA 17.9 0 100 25.5
Oswalt-1971-100-89 89 1343 SUN sun 26.7 27.3 72.2 3 7.5 0
Oswalt-1971-100-90 90 1398 DRY dry 26.3 46 4.6 6.4 7.5 49.9
Oswalt-1971-100-91 91 1441 FLY (MOVE THROUGH AIR) fly 26.1 NA 26.5 0 100 3.5
Oswalt-1971-100-92 92 1447 GIVE give 24.6 NA 65.3 3 7.5 3.5
Oswalt-1971-100-93 93 1251 BELLY belly 22.5 NA 24.6 24.6 4.5 31.6
Oswalt-1971-100-94 94 3626 KNOW know 22.5 26 3.6 6.7 100 0
Oswalt-1971-100-95 95 714 SEED seed 17.3 NA 0 0 100 3.5
Oswalt-1971-100-96 96 98 ALL all 16 NA 0 3 64.2 25.5
Oswalt-1971-100-97 97 1246 SMALL small 15.7 17.8 37.2 0 7.5 3.5
Oswalt-1971-100-98 98 1429 FULL full 15.4 32.3 0 NA 4.5 25.5
Oswalt-1971-100-99 99 1424 YELLOW yellow 15.2 NA 5.8 0 64.2 NA
Oswalt-1971-100-100 100 1554 MAN man 13.6 NA 24.2 0 7.5 25.5
96 changes: 96 additions & 0 deletions concepticondata/conceptlists/Oswalt-1971-100.tsv-metadata.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,96 @@
{
"@context": [
"http://www.w3.org/ns/csvw",
{
"@language": "en"
}
],
"dialect": {
"encoding": "utf-8-sig",
"delimiter": "\t",
"skipBlankRows": true
},
"tables": [
{
"tableSchema": {
"columns": [
{
"datatype": {
"base": "string",
"format": "[a-zA-Z]+\\-[0-9]{4}\\-[0-9]+[a-z]?\\-[0-9]+[a-z]?$"
},
"name": "ID"
},
{
"datatype": {
"base": "string",
"format": "[0-9\\.]+([a-z\\\u2013]+)?$"
},
"name": "NUMBER"
},
{
"datatype": {
"base": "integer",
"minimum": "1"
},
"name": "CONCEPTICON_ID"
},
{
"datatype": "string",
"name": "CONCEPTICON_GLOSS"
},
{
"datatype": "string",
"name": "ENGLISH"
},
{
"name": "COMBINED_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
},
{
"name": "ATHAPASKAN_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
},
{
"name": "POMO_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
},
{
"name": "EASTERN_AUSTRONESIAN_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
},
{
"name": "JAPANESE_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
},
{
"name": "COAST_SALISH_STABILITY_SCORE",
"null": "NA",
"datatype": {
"base": "float"
}
}
],
"primaryKey": [
"ID"
]
},
"url": "Oswalt-1971-100.tsv"
}
]
}
2 changes: 1 addition & 1 deletion concepticondata/conceptlists/Petroni-2010-100.tsv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
ID NUMBER ENGLISH CONCEPTICON_ID CONCEPTICON_GLOSS S_I RANK
ID NUMBER ENGLISH CONCEPTICON_ID CONCEPTICON_GLOSS INDO_EUROPEAN_STABILITY_SCORE RANK
Petroni-2010-100-1 1 You 1215 THOU 0.45395 1
Petroni-2010-100-2 2 Three 492 THREE 0.44102 2
Petroni-2010-100-3 3 Mother 1216 MOTHER 0.36627 3
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@
},
{
"datatype": "float",
"name": "S_I"
"name": "INDO_EUROPEAN_STABILITY_SCORE"
},
{
"datatype": "integer",
Expand Down
2 changes: 1 addition & 1 deletion concepticondata/conceptlists/Pozdniakov-2014-100b.tsv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
ID NUMBER ENGLISH RANK STABILITY_SCORE CONCEPTICON_ID CONCEPTICON_GLOSS
ID NUMBER ENGLISH RANK ATLANTIC_STABILITY_SCORE CONCEPTICON_ID CONCEPTICON_GLOSS
Pozdniakov-2014-100b-1 1 eye 1 0.83 1248 EYE
Pozdniakov-2014-100b-2 2 head 2 0.76 1256 HEAD
Pozdniakov-2014-100b-3 3 ear 3 0.71 1247 EAR
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@
},
{
"datatype": {"base": "decimal", "minimum": 0, "maximum": 1},
"name": "STABILITY_SCORE"
"name": "ATLANTIC_STABILITY_SCORE"
}
],
"primaryKey": [
Expand Down
2 changes: 1 addition & 1 deletion concepticondata/conceptlists/Thomas-1960-168.tsv
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
ID NUMBER ENGLISH CATEGORY SWADESH_1955_STABILITY_SCORE STABILITY_SCORE CONCEPTICON_ID CONCEPTICON_GLOSS RANK
ID NUMBER ENGLISH CATEGORY SWADESH_1955_STABILITY_SCORE MON_KHMER_STABILITY_SCORE CONCEPTICON_ID CONCEPTICON_GLOSS RANK
Thomas-1960-168-1 1 and correlatives 57 86 1577 AND 39
Thomas-1960-168-2 2 because correlatives 25 43 1157 BECAUSE 139
Thomas-1960-168-3 3 far location 74 86 1406 FAR 40
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -52,7 +52,7 @@
},
{
"datatype": {"base": "nonNegativeInteger", "maximum": 100},
"name": "STABILITY_SCORE"
"name": "MON_KHMER_STABILITY_SCORE"
},
{
"datatype": {"base": "positiveInteger", "minimum": 1, "maximum": 168},
Expand Down
11 changes: 10 additions & 1 deletion concepticondata/references/references.bib
Original file line number Diff line number Diff line change
Expand Up @@ -4964,4 +4964,13 @@ @article{Guenther2022
number = {8043},
pages = {1--13},
year = {2022}
}
}

@article{Oswalt1971,
title = {Towards the Construction of a Standard Lexicostatistic List},
author = {Oswalt, Robert L.},
journal = {Anthropological Linguistics},
volume = {13},
pages = {421--434},
year = {1971}
}