This is the GitHub repository for the Tools for Reproducible Workflows course. This course explores a variety of tools that can assist with reproducible data analysis from a broad range of fields. This course was funded as part of a series of courses in the Training Module for Reproducible Data Science Research project.
Except where otherwise indicated, the contents of this course are available for use under the Creative Commons Attribution 4.0 license. You are free to adapt and share the work, but you must give appropriate credit, provide a link to the license, and indicate if changes were made. Sample attribution: Tools for Reproducible Workflows in R by Fred Hutchinson Data Science Lab and University of Texas, Austin (CC-BY 4.0). You can download the illustrations by clicking here.
This course will demonstrate how to:
- Explain best practices for making analyses more reproducible and transparent
- Use special features in RStudio for efficiency and reproducibility
- Configure and organize projects for data analysis using the here package and the ProjectTemplate package
- Create reproducible reports using RMarkdown and Quarto
- Write custom functions for reuse of code
- Test functions with the testthat package
- Setup and use Git and GitHub to track changes over time 8. Share data and code publicly
This course was built with OTTR
All materials in this course are licensed under a Creative Commons Attribution 4.0 International License unless noted otherwise.