Bioinformatics for Biologists

Materials for the R sections of the "Bioinformatics for Biologists" course at Cambridge University

View the Project on GitHub tavareshugo/2018-12-03-bioinformatics_for_biologists

Introduction to data analysis with R

3-4 December 2018, Cambridge University Bioinformatics Training

Instructors: Hugo Tavares & Sandra Cortijo (Sainsbury Laboratory)


This is a general introduction to R for data analysis.

Our practicals will be very hands-on, focusing on learning the necessary sintax to allow you to do data analysis in R, from data manipulation to visualisation. We will focus on tabular data, which is general enough to allow you to apply these skills to a wide range of problems.

Below, we provide links to detailed materials for your reference, many of which were developed by the Data Carpentry organisation.

If you have any queries please post a new issue on our GitHub repository.


Setup

All necessary software and data will be available on the training machines at the Bioinformatics Training Room (Craik-Marshall Building).

However, you are welcome to use your own laptop, in which case you need to:


Data Organisation in Spreadsheets

Digital data recording often starts with a spreadsheet software (e.g. Excel). For an effective data analysis, it’s crucial to start with a well structured and formatted dataset. Because of this, before diving into R, we will start by having a discussion about common issues that should be considered when recording data in spreadsheets.

Further reading:

Introduction to R

This lesson will cover the very basics of using R with RStudio.

Detailed reference materials:

exercises

Data manipulation and visualisation in R

This lesson will cover some functions to effectively manipulate and summarise tabular data using the dplyr package and we will start to learn how to visualise data with the ggplot2 package.

Detailed reference materials:

Exploratory RNAseq data analysis in R

In this session we will apply the concepts learned so far to a worked example of an exploratory data analysis of transcriptomic data.

During the lesson, we will also learn a few more tricks in R, including:

Further reading:

Further resources

Extra materials/books: