View the Project on GitHub tavareshugo/2019-07-01-bioinformatics_for_biologists
1, 2 & 4 July 2019, Cambridge University Bioinformatics Training
Instructors: Hugo Tavares & Sandra Cortijo
Helper: Martin van Rongen
This is a general introduction to R for exploratory data analysis.
Our practicals will be very hands-on, focusing on learning the necessary sintax to allow you to do exploratory data analysis in R, from data manipulation to visualisation. We will focus on tabular data, which is general enough to allow you to apply these skills to a wide range of problems. On the third day we will go through a more complex example using transcriptomic data.
Below, we provide links to detailed materials for your reference, many of which were developed by the Data Carpentry organisation.
If you have any questions please post a new issue on our GitHub repository.
All necessary software and data will be available on the training machines at the Bioinformatics Training Room (Craik-Marshall Building).
However, you are welcome to use your own laptop, in which case you need to:
tidyverse
, corrplot
, cowplot
and ggfortify
(open RStudio and go to Tools > Install Packages
)ComplexHeatmap
(instructions here)This lesson will cover the basics of using R with RStudio and how to produce a wide range of graphs for data visualisation.
ggplot2
(part I)
This lesson will cover some functions to effectively manipulate and summarise tabular data and we will learn more about data visualisation.
Digital data recording often starts with a spreadsheet software (e.g. Excel). For an effective data analysis, it’s crucial to start with a well structured and formatted dataset. Because of this, we will have a brief discussion about common issues that should be considered when recording data.
Further reading:
In this session we will apply the concepts learned so far to a worked example of an exploratory data analysis of transcriptomic data.
Further reading:
Reference books:
tidyverse
packages.Other courses at Cambridge: