Load data

  • Load the 2 .Rdata files that you can find in the data/ directory. It contains gene expression data and metadata about experimental design and 3 mouse strains used (B6, D2, AK)
  • Using ls() you can find the variables saved in these files
  • Which type of variables are they ?

Inspect data

  • Have a look at the data
  • Describe them
  • Can you see some problems using the summary() function ?

Reformat

  • Reformat the metadata and expression data that are needed
  • Do you see new problems with the summary() function ?

Filter metadata

  • Only keep the following strains: B6, D2 in your metadata

Reorder

  • Filter accordingly the expression data.
  • Metadata Samples ID and expression data samples should be in the same order

Filter

  • Filter low expressed genes with a mean less than 5

NA remove

  • Remove all genes with empty gene expression value

Plot

  • Plot the mean expression overall gene using plot()

Save data

  • Save metadata and expression data using the following command:
save(metadata,expression,file="data/Ex2.Results.Rdata")