Weird p-values using DESeq2?
1
0
Entering edit mode
ingrid.h • 0
@a9651a34
Last seen 14 hours ago
Norway

Hi! I have used DESeq2 to analyze my RNAseq data. I am studying and comparing two types of tissues.

I have made some boxplots showing different gene markers and the gene counts, comparing the two tissues. When looking at the significantly affected genes in the plot, often the boxes is not that far apart as you would expect. Thereafter, when looking into the counts data, I got a bit surprised.

Examples:
Gene1:
Counts tissue 1: 342, 263, 486
Counts tissue 2: 347, 323, 543
p-adj: 0.006

Gene2:
Counts tissue 1: 19005, 14575, 24455
Counts tissue 2: 8840, 25618, 24298
p-adj: 0.019

Gene3:
Counts tissue 1: 10100, 8286, 14597
Counts tissue 2: 7334, 6179, 14408
p-adj: 4.2e-05

How does these count values and their p-values make sense?
I do not understand why they are significant when the mean values are so close.

DESeq2 RNASeq • 214 views
ADD COMMENT
1
Entering edit mode

This does not look like normalized counts but raw counts. Consider using counts(dds, normalized = TRUE) or the output of normTransform() or vst/rlog(). Generally, MA-plots and volcanos are a good start for diagnosis how the DE profile is across the entire dataset, this in combination with plotPCA() is routine diagnosis.

ADD REPLY
0
Entering edit mode
@wolfgang-huber-3550
Last seen 12 hours ago
EMBL European Molecular Biology Laborat…

Dear Ingrid,

thank you for asking. Can you provide the input count matrix (you can "pseudonymize" the biological conditions and the gene names) and your R code?

Maybe the first thing to look at is the size factors for each replicate, and the normalized counts.

Thanks and best wishes

Wolfgang

ADD COMMENT

Login before adding your answer.

Traffic: 830 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6