Name	Name	Last commit message	Last commit date
parent directory ..
.gitattributes	.gitattributes
README.md	README.md
StructureDataset-metadata.json	StructureDataset-metadata.json
codes.csv	codes.csv
data.csv	data.csv
requirements.txt	requirements.txt
societies.csv	societies.csv
sources.bib	sources.bib
variables.csv	variables.csv

StructureDataset D-PLACE dataset derived from Murdock and White 1969 'Standard Cross-Cultural Sample'

CLDF Metadata: StructureDataset-metadata.json

property	value
dc:bibliographicCitation	Murdock GP & White DR. 1969. Standard Cross-Cultural Sample. Ethnology. 9:329–369.
dc:conformsTo	CLDF StructureDataset
dc:identifier	https://escholarship.org/uc/item/62c5c02n
dc:license	https://creativecommons.org/licenses/by-nc/4.0/
dcat:accessURL	https://github.com/D-PLACE/dplace-dataset-sccs
prov:wasDerivedFrom	HRAF Cross Cultural Concordance 2020 D-PLACE/dplace-dataset-sccs v3.2 Glottolog v5.2
prov:wasGeneratedBy	python: 3.12.3 python-packages: requirements.txt
rdf:ID	dplace-dataset-sccs
rdf:type	http://www.w3.org/ns/dcat#Distribution

Table data.csv

Values are coded datapoints, i.e. measurements of a variable for a society.

Note: Missing data is signaled by an empty Value column.

property	value
dc:conformsTo	CLDF ValueTable
dc:extent	329449

Columns

Name/Property	Datatype	Description
ID	`string` Regex: `[a-zA-Z0-9_\-]+`	Primary key
Soc_ID	`string`	References societies.csv::ID
Var_ID	`string`	References variables.csv::ID
Value	`string`	Values for categorical and ordinal variables reference the corresponding code via the Code_ID column. Values for continuous variables have the measured number in the Value column and an empty Code_ID.
Code_ID	`string`	References codes.csv::ID
Comment	`string`
Source	list of `string` (separated by `;`)	References sources.bib::BibTeX-key
`sub_case`	`string`	More specific description of the population the data refer to in terms of society or area.
`year`	`string` Regex: `-?[0-9]{1,4}(-[0-9]{4})?`	Focal year, i.e. the time period to which the data refer.
`source_coded_data`	`string`	The source of the coded data, which was aggregated in this dataset.
`admin_comment`	`string`

We use the term “society” to refer to cultural groups. In most cases, a society can be understood to represent a group of people at a focal location with a shared language that differs from that of their neighbors. However, in some cases multiple societies share a language.

property	value
dc:conformsTo	CLDF LanguageTable
dc:extent	186

Columns

Name/Property	Datatype	Description
ID	`string` Regex: `[a-zA-Z0-9_\-]+`	Primary key
Name	`string`
Latitude	`decimal` ≥ -90 ≤ 90
Longitude	`decimal` ≥ -180 ≤ 180
Glottocode	`string` Regex: `[a-z0-9]{4}[1-9][0-9]{3}`
`Name_and_ID_in_source`	`string`	Society names identified as pejorative have been replaced with a preferred, English-language ethnonym. The name (and ID) as given in the source dataset is kept in this field.
`xd_id`	`string`	“cross-data-set” identifier, used to link societies present in different datasets, if they share a focal location. Note: If this field is empty, other fields such as Name, Glottocode, focal year and location may be used to identify societies across datasets if appropriate.
`alt_names_by_society`	list of `string` (separated by `;` )	A list of ‘alternate’ names for the society; includes, where available, one or more autonyms in the society’s own language, as well as other commonly encountered ethnonyms.
`main_focal_year`	`integer`	Focal year specifying the time period to which the data refer, given as number of years BCE - if negative - or CE.
`HRAF_name_ID`	`string`	Name(s) and ID(s) of the corresponding society in HRAF (the Human Relations Area Files)
`HRAF_ID`	`string`	ID of the corresponding society in HRAF
`origLat`	`decimal` ≥ -90 ≤ 90	Uncorrected latitude as given in the source.
`origLong`	`decimal` ≥ -270 ≤ 180	Uncorrected longitude as given in the source.
comment	`string`
`glottocode_comment`	`string`	Comment on the Glottocode assignment.
`region`	`string`	World Geographical Scheme for Recording Plant Distributions level2 region

Table variables.csv

Variables are cultural features or practices, or environmental descriptors.

property	value
dc:conformsTo	CLDF ParameterTable
dc:extent	1781

Columns

Name/Property	Datatype	Description
ID	`string` Regex: `[A-Za-z.0-9_]+([0-9]+)?`	Primary key
Name	`string`
Description	`string`
ColumnSpec	`json`
`category`	list of `string` (separated by `,` )
`type`	`string` Valid choices: `Continuous` `Categorical` `Ordinal`	Variables may be categorical (and then must be accompanied by a list of possible ‘codes’, i.e. rows in Codetable. Variables can also be continuous (e.g. Population size) or ordinal. Ordinal variables are accompanied by a list of codes (like categorical variables). The order of codes is encoded as `ord` column in CodeTable.
`unit`	`string`	The unit of measurement
`source_comment`	`string`	A note about the source of this variable.
`changes`	`string`	Notes about how a variable may have been derived from the source.
comment	`string`

Table codes.csv

property	value
dc:conformsTo	CLDF CodeTable
dc:extent	10140

Columns

Name/Property	Datatype	Description
ID	`string` Regex: `[a-zA-Z0-9_\-]+`	Primary key
Var_ID	`string`	The parameter or variable the code belongs to. References variables.csv::ID
Name	`string`
Description	`string`
`ord`	`integer`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

StructureDataset D-PLACE dataset derived from Murdock and White 1969 'Standard Cross-Cultural Sample'

Table data.csv

Columns

Table societies.csv

Columns

Table variables.csv

Columns

Table codes.csv

Columns

FilesExpand file tree

cldf

Directory actions

More options

Directory actions

More options

Latest commit

History

cldf

Folders and files

parent directory

README.md

StructureDataset D-PLACE dataset derived from Murdock and White 1969 'Standard Cross-Cultural Sample'

Table data.csv

Columns

Table societies.csv

Columns

Table variables.csv

Columns

Table codes.csv

Columns