Hi,
I have a quick question on DEseq2 replicates.
A good number of genes were marked as outliers (912, 5.7%) in my DEseq2 summary(res) output. I am wondering what that means. I found the following sentence in the vignettes:
"The results function automatically flags genes which contain a Cook's distance above a cutoff for samples which have 3 or more replicates....When there are 7 or more replicates for a given sample, the DESeq function will automatically replace counts with large Cook’s distance with the trimmed mean over all samples, scaled up by the size factor or normalization factor for that sample."
Do you mean technical or biological replicates here? In my case, I only have biological replicates, no technical replicates. Does it matter?
Also, when you mean replicate, it means the replicate regarding to the contrast, right? For example, I have 80 samples (from 10 case and 10 control subjects, each subject having 4 different cell types, no tech replicate). If my design is like
design(dds) = ~ cell + condition
How will DESeq2 calculate Cook's distance? Will it consider the 40 control samples as 40 replicates when calling result(dds) ?
Thanks,
Xianjun