Question

Weird p-values using DESeq2?

0

Entering edit mode

ingrid.h • 0

@a9651a34

Last seen 6 weeks ago

Norway

Hi! I have used DESeq2 to analyze my RNAseq data. I am studying and comparing two types of tissues.

I have made some boxplots showing different gene markers and the gene counts, comparing the two tissues. When looking at the significantly affected genes in the plot, often the boxes is not that far apart as you would expect. Thereafter, when looking into the counts data, I got a bit surprised.

Examples:
Gene1:
Counts tissue 1: 342, 263, 486
Counts tissue 2: 347, 323, 543
p-adj: 0.006

Gene2:
Counts tissue 1: 19005, 14575, 24455
Counts tissue 2: 8840, 25618, 24298
p-adj: 0.019

Gene3:
Counts tissue 1: 10100, 8286, 14597
Counts tissue 2: 7334, 6179, 14408
p-adj: 4.2e-05

How does these count values and their p-values make sense?
I do not understand why they are significant when the mean values are so close.

DESeq2 RNASeq • 1.1k views

ADD COMMENT • link updated 7 weeks ago by ATpoint ★ 4.8k • written 7 weeks ago by ingrid.h • 0

1

Entering edit mode

This does not look like normalized counts but raw counts. Consider using counts(dds, normalized = TRUE) or the output of normTransform() or vst/rlog(). Generally, MA-plots and volcanos are a good start for diagnosis how the DE profile is across the entire dataset, this in combination with plotPCA() is routine diagnosis.

ADD REPLY • link 7 weeks ago ATpoint ★ 4.8k

score 0 · Answer 1 · 2025-02-14

0

Entering edit mode

Wolfgang Huber ★ 13k

@wolfgang-huber-3550

Last seen 6 weeks ago

EMBL European Molecular Biology Laborat…

Dear Ingrid,

thank you for asking. Can you provide the input count matrix (you can "pseudonymize" the biological conditions and the gene names) and your R code?

Maybe the first thing to look at is the size factors for each replicate, and the normalized counts.

Thanks and best wishes

Wolfgang

ADD COMMENT • link 7 weeks ago Wolfgang Huber ★ 13k