Overview

This workshop will explore ways to incorporate data-scientific topics into existing statistics courses, especially at the introductory level. It will preview modules that allow for the inclusion of introductory tutorials and lab activities into courses for students with limited technical backgrounds. Participants will work through class-tested tutorials and lab activities. Key topics will include data wrangling and statistical graphics. The workshop will close with a discussion related to how these materials have been used in existing courses and an overview of the supporting materials. Basic knowledge of R will be valuable, but not required.

The tutorials and case studies were developed by Adam Loy, Shonda Kuiper, and Laura Chihara with support from the ACM FaCE program and The Teagle Foundation.

Special thanks for Ivan Ramler for helping with the workshop!


Preparation

Prior to the workshop, please do the following:

  1. Make sure that you have access to R and RStudio. I set up an RStudio server for today’s workshop: http://bit.ly/ds4stats-rstudio
  2. If using your own computer, update/install your R packages using the below code:

    update.packages()
    install.packages(c("mosaic", "tidyverse", "rmarkdown"))
  3. If you have never seen R Studio or R markdown before, then please read the following:

Outline

Welcome!

  1. Introductions
  2. Open communal notes – Add questions, comments, and notes throughout the workshop
  3. Brief overview of the project

Data visualization with ggplot2

Workflow
  1. Download the zip file containing the module. (Note: Mac users will need to use Chrome or Firefox, the download does not work with Safari.)
  2. Upload the zip file to the R Studio server OR unzip and store in a logical location on your laptop.
  3. Plot data with ggplot2!
Additional resources
If you get done early…

Check out another tutorial related to statistical graphics

Data wrangling with dplyr

Workflow
  1. Download the zip file containing the module. (Note: Mac users will need to use Chrome or Firefox, the download does not work with Safari.)
  2. Upload the zip file to the R Studio server OR unzip and store in a logical location on your laptop.
  3. Wrangle data with dplyr!
Additional resources
If you get done early…

Check out another tutorial/exercise related to data wrangling

Wrapping up

  1. Pull files off the RStudio server
  2. What topics/materials are missing?
  3. How might you incorporate these modules into your classes?

Additional Resources

  1. Check out the Tutorials and Case studies tabs of the website
  2. Full list of tutorials on Stat2Labs
  3. ds4stats GitHub organization
  4. My Fall 2016 intro stats materials
  5. rawgit for displaying .html versions of the tutorials directly from GitHub
  6. DownGit for downloading .zip files for specific folders on GitHub