The Basics

Lastly, we review the basics of statistics and R using this course from Edx.

In order to access course material, make sure to create an Edx account (if you do not have one already) and enroll in the course. It's completely free.

Introduction to Exploratory Data Analysis

Exploratory Data Analysis (EDA) is a key part of what we do when we analyze data. To learn more about it, read this article from the course, and return here afterward.

A histogram is a graphical representation that organizes a group of data points into user-specified ranges.

To learn about plotting histograms in R:

  1. Watch this video from the course

  2. Complete these exercises.

A Q-Q plot is a scatterplot created by plotting two sets of quantiles against one another.

  1. Watch this video from the course about plotting Q-Q plots in R

Finally, we cover boxplots - a standardized way of displaying the dataset based on a five-number summary: the minimum, the maximum, the sample median, and the first and third quartiles.

  1. Watch this video explaining how to plot Boxplots in R

