Location: Sainsbury Laboratory Cambridge University (map)

Dates:

Instructors: Sandra Cortijo, Hugo Tavares, Anna Gogleva

Requirements: participants must bring a laptop with a few specific software packages installed beforehand (details below).

Description

This is a series of short and modular workshops where you will learn how to analyse data using the statistical software R. Each module will be 1h30m - 2h long and we will cover topics such as:

We specifically focus on tabular data (like you would produce in a spreadsheet) and aim to very quickly teach you how to produce high-quality graphs to enable efficiently data exploration.

No prior knowledge of R is assumed. Even if you already know R, this might be useful if you want to refresh your memory or learn about specific packages that we will cover (ggplot2 for plotting and dplyr for manipulating tables).

As an example dataset we will be using real data published by Burghardt .. Schmitt (2015).

Course Materials

Module 1

File organisation

Data organisation

Intro to R and read your data into R

Module 2

Data visualisation (exercise solutions)

Module 3

Data manipulation

Module 4

Advanced data manipulation

Practice exercise

Dataset

Solutions!

Further resources

The University runs several introduction courses on R and other topics (bioinformatics, image analysis, etc.):

Software installation

You will need a laptop with the following software installed:

In each case, download the relevant installer for your operating system (Mac, Windows and Linux versions are available) and install the programs as you would do normally.

After installing both of those programs, we would ask you to install some R packages (these are like “extensions” to add extra functionality to R):

  1. Make sure you’re connected to the internet
  2. Open RStudio
  3. In the top menu choose Tools > Install Packages...
  4. Type the following in the “Packages” box: tidyverse,visdat,plotly
  5. Click “Install” and the installation should happen automatically

Please confirm that the packages are properly installed by doing the following:

  1. Open RStudio
  2. In the “Console” panel (bottom left) type the following commands, one at a time:
    • library(tidyverse)
    • library(visdat)
    • library(plotly)
  3. The output should be similar to what is shown in the screenshot below. If you get a message, such as: “Error in library(tidyverse): there is no package called ‘tidyverse’” please get in touch with one of the instructors before the course.

Screenshot of correct package installation