The Basics

This page takes an estimated 1 hour to complete.

Lastly, we review the basics of statistics and R using this course from Edx.

In order to access course material, make sure to create an Edx account (if you do not have one already) and enroll in the course. It's completely free.

Introduction to Exploratory Data Analysis

Exploratory Data Analysis (EDA) is a key part of what we do when we analyze data. To learn more about it, read this article from the course, and return here afterward.

You can use the buttons on the course page to navigate through the various topics. Alternatively, you can always refer to the links here. Whichever way you feel more comfortable with.


A histogram is a graphical representation that organizes a group of data points into user-specified ranges.

To learn about plotting histograms in R:

  1. Watch this video from the course

  2. Complete these exercises.

Return here after you finish.


A Q-Q plot is a scatterplot created by plotting two sets of quantiles against one another.

  1. Watch this video from the course about plotting Q-Q plots in R

Return here after you finish.


Finally, we cover boxplots - a standardized way of displaying the dataset based on a five-number summary: the minimum, the maximum, the sample median, and the first and third quartiles.

  1. Watch this video explaining how to plot Boxplots in R

Return here after you finish.


If you made it here, then congratulations! You have successfully completed this section. Move to the next portion of the guide with the arrow buttons below.

Last updated