Question

DESeq2 subexperiments vs. Whole experiment

0

Entering edit mode

jshouse ▴ 10

@jshouse-10956

Last seen 23 months ago

United States

I have an experiment with 45 samples. These 45 samples consist of 3 different surgeries, two treatments and two time periods for a total of 10 different experimental conditions with 4-5 experimental replicates for each.

I'm using DESeq2 to analyze for DE of genes from RNAcounts.

There are really multiple comparisons we are interested in

Surgeries versus Sham at two time points
Surgery 1 over sham vs. Surgery2 vs. Sham
Treatment vs control (treatment is different from surgery)
Surgery differences in comparison to treatment vs. control

My question revolves around best practices. I originally ran DESeq2 on (treatment) which is a concatenation of two treatments, 3 surgeries and two time periods (this is 10 groups of 4-5 replicates each, because time period 2 is missing one surgery). design ~ treatment

Then I have been using contrast statements to examine individual comparisons such as the following:

res.d1.ozone <-results(dds,contrast=c("treatment","Day1OzoneSHAM","Day1AirSHAM"),parallel = TRUE)

This compares time period one, sham surgery, Ozone to Air. etc...

My question: Should I be running a separate dds (and normalization etc...) followed by:

dds <- DESeq(dds)
res <- results(dds)

on subsets of data (in this case 4reps of each) or am I better off normalizing all day in a single dds as I have done on all 45 samples and using contrast statements as above to evaluate?

deseq2 • 970 views

ADD COMMENT • link updated 8.5 years ago by Ryan C. Thompson ★ 7.9k • written 8.5 years ago by jshouse ▴ 10

0

Entering edit mode

Also, the second time point is way different than the first in terms of outcome, so I won't be using that to create a multi-factor design. I will use a multi-factor design within a time period to exam both treatment and surgery factors at the same time.

ADD REPLY • link 8.5 years ago jshouse ▴ 10

2

Entering edit mode

Ryan C. Thompson ★ 7.9k

@ryan-c-thompson-5618

Last seen 8 weeks ago

Icahn School of Medicine at Mount Sinai…

In general, the best approach is to include all the samples in a single model and then pick out specific comparisons from the full model using contrasts. The only reason you might consider fitting multiple separate models on subsets of the data would be if you believed that there were substantial differences in the dispersions of each treatment group.. Although in that case I would rather recommend using voomWithQualityWeights from the limma package, which can accommodate samples/groups with different variances in a single model.

ADD COMMENT • link 8.5 years ago Ryan C. Thompson ★ 7.9k

score 3 · Accepted Answer · 2016-06-22

3

Entering edit mode

Michael Love 43k

@mikelove

Last seen 1 hour ago

United States

There is a FAQ in the vignette about this question:

vignette("DESeq2")

Let me know if that helps answer your question.

ADD COMMENT • link 8.5 years ago Michael Love 43k

0

Entering edit mode

Thank you. It did. I should have read the FAQs first.

ADD REPLY • link 8.5 years ago jshouse ▴ 10