Question

Differential Gene Expression

0

Entering edit mode

fromhj304 ▴ 10

@fromhj304-11588

Last seen 8.1 years ago

Hi all, I'm right now working with a RNA-seq raw count data file (in .txt format). It's a matrix of ~50,000 genes (row) and 8 samples (column).

My assignment is to get differential gene expression analysis. But I'm not really sure what it means by that. I'm using DESeq2 package in R for this task, and since I'm really new to R and bioinformatics, I'm not really sure what they want as the result.

So my questions are:

1) What does one expect from differential expression analysis? I've got the list of genes that have p-values less than 0.05. Would that be sufficient for differential expression analysis? What I have right now is just the list of genes.

2) What is MA plot and what does it have to do with differential expression analysis? I've read some papers about RNA-Seq and R packages that help analysis, and most of them mentioned MA plot. What is its significance in DE analysis? And what can we see from it?

Here is part of my source code that I wrote to get the list of genes that have p-values less than 0.05:

#get differential expression results
res<-results(dds)

#sort by p-value
res<-res[order(res$pvalue),]
head(res)

#write a .csv file that contains information of genes
#that have p-value smaller than 0.05
write.csv(res[which(res$pvalue<0.05),], file="DEoutput.csv")

Thanks!

deseq2 bioinformatics rna-seq Tutorial • 1.7k views

ADD COMMENT • link updated 8.1 years ago by Michael Love 43k • written 8.1 years ago by fromhj304 ▴ 10

score 1 · Answer 1 · 2016-10-03

MA plots were originally an invention of Terry Speed's group at UC Berkeley for two-color microarrays. MA plots first appeared in the sma and affy packages back in 2002, and were a way to display the raw data from a two-color microarray or for a pair of Affymetrix microarrays. The idea of using MA plots to display the results of a differential expression analysis appeared in the limma package in 2004.

A short review of MA plots can be found in the section on "mean-difference plots" in this article:

http://nar.oxfordjournals.org/content/43/7/e47.full

score 1 · Answer 2 · 2016-10-03

1

Entering edit mode

Michael Love 43k

@mikelove

Last seen 1 day ago

United States

"1) What does one expect from differential expression analysis? I've got the list of genes that have p-values less than 0.05. Would that be sufficient for differential expression analysis? What I have right now is just the list of genes."

This is one possible "end point" for DE analysis. The list of genes with adjusted p-value less than X should satisfy the property that in expectation the false discovery rate in this set is less than X.

"2) What is MA plot and what does it have to do with differential expression analysis? I've read some papers about RNA-Seq and R packages that help analysis, and most of them mentioned MA plot. What is its significance in DE analysis? And what can we see from it?"

Check Gordon Smyth's pointer to a paper describing the plot.

You can also read about the MA plot in DESeq2 by reading Section 1.5.1 MA-plot, in the DESeq2 vignette:

vignette("DESeq2")

Or the help page for DESeq2's plotMA function:

?plotMA

Or the description of DESeq2's MA plot in our RNA-seq workflow:

http://www.bioconductor.org/help/workflows/rnaseqGene/#plotting-results

ADD COMMENT • link 8.1 years ago Michael Love 43k

0

Entering edit mode

Hi thank you so much for the answer!

But why did you use adjusted p-values instead of just p-values? What are their differences?

Thanks again!

ADD REPLY • link 8.1 years ago fromhj304 ▴ 10

0

Entering edit mode

See "Multiple testing" in our workflow.

ADD REPLY • link 8.1 years ago Michael Love 43k

0

Entering edit mode

Thank you so much!!

ADD REPLY • link 8.1 years ago fromhj304 ▴ 10