Standardize vocabulary #7

JG-QuarkNet · 2021-04-01T17:58:39Z

In 2019, the CMS data used by iSpy was reorganized from a "flat" 10,000 events into a set of "tranches" of data tailored to student groups of different sizes.

The alterations Joel made to CIMA to accommodate this did not keep a consistent vocabulary of labels and variable names in reference to this new data organization scheme. In addition, the original program contained several unclear or misleading variable names.

Labels and variable names should be standardized. This was Joel's working template for syntax at the time he made the changes:

datablock: 5,10,25,50,100
dataset: 5.1, 10.6, etc.
dataset id: dataset -> [1,190]
dataset number: 1, ..., 100
dataset index: 5.1-4, 10.6-55, etc.
unique id: (int)[(string)(dataset id) + (string,3)(dataset number)]
(replaces "flat" event_id: [1,10000])

He did not consistently apply these, though, and we've fallen into different usages since this change. Tom suggested "group" instead of "datablock", but "group" was already used by CIMA's previous data system and Joel wanted to avoid confusion. Ken has also started using "data file" for what's labeled here as "dataset," which seems fine.

The text was updated successfully, but these errors were encountered:

JG-QuarkNet mentioned this issue Apr 1, 2021

Review and revise Admin workflow #17

Open

JG-QuarkNet added the PHP label Apr 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standardize vocabulary #7

Standardize vocabulary #7

JG-QuarkNet commented Apr 1, 2021 •

edited

Loading

Standardize vocabulary #7

Standardize vocabulary #7

Comments

JG-QuarkNet commented Apr 1, 2021 • edited Loading

JG-QuarkNet commented Apr 1, 2021 •

edited

Loading