Sharpen your Data Science skills with this is a hands-on workshop on regression techniques in R.
This workshop will give you the practical skills and foundational knowledge to effectively use some powerful regression models used by data scientists. When data are collected on the same subjects repeatedly over time (for example, in clinical trials or cohort studies) or under different conditions (for example, in a designed experiment), the measurements within the same individual are modeled as having correlated values. At the workshop, we will consider several models that can be employed to model a normally distributed response variable. The models that we will consider are: random slope and intercept (mixed-effects) model and generalized estimating equations models with unstructured, autoregressive, compound symmetric (exchangeable), and independent working correlation matrices. All models will be run in R version 4.0.3.
The course will be structured as follows. For each part, we will first discuss the theory, then work through an example. After that, the participants will work in small groups in break-out rooms to do hands-on exercises to help reinforce the material. All the files and Rstudio will be made available to the participants.
We would like to use the RStudio Cloud. If you are not familiar with this technology, the participants use a web browser to access RStudio. The environment will be setup and loaded with the code and data that is needed. This way, participants can focus on building models.
The material covered by the workshop will be taken from my recently published book “Advanced Regression Models with SAS and R Applications”, CRC Press, 2018.
Dr. Olga Korosteleva, is a professor of Statistics at the Department of Mathematics and Statistics at California State University, Long Beach (CSULB). She received her Bachelor’s degree in Mathematics in 1996 from Wayne State University in Detroit, and a Ph.D. in Statistics from Purdue University in West Lafayette, Indiana, in 2002. Since then she has been teaching mostly Statistics courses in the Master’s program in Applied Statistics at CSULB, and loving it!
Dr. Olga is an undergraduate advisor for students majoring in Mathematics with an option in Statistics. She is also the faculty supervisor for the Statistics Student Association. She is also the immediate past-president of the Southern California Chapter of the American Statistical Association (SCASA). Dr. Olga is the editor-in-chief of SCASA’s monthly eNewsletter and the author (co-author) of four statistical books.
When: February 9, 2021
- Tuesday: 6:30 PM - 09:45 PM
Where:
This event will be held on Zoom. You will need a Zoom account in order to join. Before the event, the Zoom link will be emailed to you.
Registration
- Cost: $10
- Register through EventBright
-
All participants must register for the event
-
All participants must abide by the OCRUG Code of Conduct, including the R Consortium and the R Community Code of Conduct.
You will need Zoom installed on your computer and an account. The zoom connection information is:
- https://oracle.zoom.us/j/93822369651?pwd=ZE9HUGdTVk40MDhabnE4aFRHQlQ3UT09
- Meeting ID: 938 2236 9651
- Password: 63282977
- Dial by your location: +1 669 900 6833 US (San Jose)
- Find your local number: https://oracle.zoom.us/u/abCl13paS
You have two options for working with the code examples and exercises for the workshop:
- Download and install R and RStudio (if you haven't already)
- Download the examples and exercises code from the workshop GitHub repository: https://github.com/ocrug/regression_models_2021-02-09
- If you don't know how to use Git, download the course files by clicking the green "Code" button and select "Download ZIP".
- If you do know how to use Git, clone the repo to your computer
- Unzip the files (or go to the directory where you cloned the repository), and double click the file called
project.Rproj
. This will start RStudio and you can see the examples and exercises code in the two folder called examples and exercises. - Install the following packages
reshape2
rcompanion
nlme
geepack
MuMIn
- Create a free account on RStudio Cloud: https://rstudio.cloud
- Go to the workshop project: https://rstudio.cloud/project/2051108
- At the top of the project window, Click "Save a Permanent Copy" — it's by the flashing red "Temporary Project" sign.
- The project and all its files will now be in your own Personal workspace. You have 15 free hours per month using RStudio Cloud.
OCRUG GitHub Repo: https://github.com/ocrug/
You do not need to download the github repo. All files that you need will be provided on the RStudio Cloud instance.
Event Repo: https://github.com/ocrug/regression_models_2021-02-09
A slack channel has been set up for the event. This will be used for general announcements but it is also a great source for you to ask questions to other participants.
If you have not created an account on our slack group, create one using the following link:
Slack Group Sign-up: https://tinyurl.com/socalrug-slack-signup
Once you have an account, sign in (you can do it on a web browser or download an app on your phone or desktop).
Slack channel: https://tinyurl.com/socalrug-slack
The channel for the course is regression-2021
Please follow us on twitter, oc_rug
, and also tweet about the event with the hash tag #OCRUG
Start | End | Activity |
---|---|---|
06:30 | 06:40 | Introduction |
06:40 | 07:30 | Mixed-effects Model for Normal Response |
07:30 | 07:50 | Mixed-effects Model Exercise |
07:50 | 08:00 | Mixed-effects Model Solution |
08:00 | 08:10 | Break |
08:10 | 08:30 | Generalized Estimating Equations (GEE) Model for Normal Response |
08:30 | 08:50 | GEE Exercise |
08:50 | 09:00 | GEE Solution |
09:00 | 09:30 | Additional Exercise and Solution |
09:30 | 09:45 | Wrap up |
This event is being brought to you by the Orange Country R Users Group OCRUG
This event is sponsored by the University of California, Paul Merage School of Business. https://merage.uci.edu/