Question

dispersion increasing for extreme high mean counts

0

Entering edit mode

XTR5 ▴ 10

@p1000

Last seen 3.5 years ago

United States

I am noticing an interesting trend on the dispersion plot where dispersion estimates are increasing for genes with extremely high counts:

enter image description here

These are selection experiments where we expect a high degree of variance for highly-expressed genes (a given gene may be highly expressed in one condition but not another). Still, I think the model fit is generally performing fairly well across the range of counts in this example. Would you be wary of these dispersion estimates leading to unpredictable p-value estimates from the NB GLM?

DESeq2 • 772 views

ADD COMMENT • link 3.8 years ago XTR5 ▴ 10

score 2 · Accepted Answer · 2021-06-23

2

Entering edit mode

Michael Love 43k

@mikelove

Last seen 1 day ago

United States

"we expect a high degree of variance for highly-expressed genes (a given gene may be highly expressed in one condition but not another)"

The dispersion doesn't increase if there are changes across condition, only due to variation within condition.

The red line in dispersion plot seems like a good reflection of the trend of the MLE estimates.

"Would you be wary of these dispersion estimates leading to unpredictable p-value estimates from the NB GLM?"

The dispersion posterior estimates seem to track closely with the trend so I don't see any issue.