-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathadvanced.qmd
52 lines (44 loc) · 1.06 KB
/
advanced.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
title: "Advanced Tips"
subtitle: "Data4All"
author: "Ted Laderas, PhD"
format:
live-html:
scrollable: true
toc-location: left
engine: knitr
webr:
render-df: paged-table
packages:
- ggplot2
- dplyr
- DT
resources:
- data
pyodide:
render-df: paged-table
resources:
- data
packages:
- openpyxl
- pandas
---
## No Calculations in the Raw Data files
- Keep the Raw Data Raw and read-only when possible
- Make backups (see below)
## Make Backups
- Avoid modifying raw data in place
- Version the file when you add more data
- Write protect the files
- Make copies for analysis and yourself
## Use Data Validation to Avoid Errors
- Select column, then **Data >> Validation**
- Think about what data types a column should be
- A whole number in some range
- A decimal number in some range
- A list of possible values
- Text, but with a limit on length
- Specify when values are missing consistently
## Save the Data in Plain Text Files
- Excel: **File >> Save As >> CSV UTF-8**
- Google Sheets: **File >> Download >> CSV**