Question

timing when to use collapseReplicates in DESeq2

0

Entering edit mode

wiscoyogi • 0

@wiscoyogi-21673

Last seen 2.2 years ago

United States

I am using DESeq2 and have a genes x counts matrix where some of the columns are technical replicates. I want to use collapseReplicates, but I'm confused by the documentation about when to do it (e.g. after calling DESeqDataSetFromMatrix or on the raw counts table)

the documentation( http://bioconductor.org/packages/release/bioc/vignettes/DESeq2/inst/doc/DESeq2.html) says "DESeq2 provides a function collapseReplicates which can assist in combining the counts from technical replicates into single columns of the count matrix", but this sounds like the raw counts matrix. However, the R link makes it sound like it's on a DESeqDataSet (https://www.rdocumentation.org/packages/DESeq2/versions/1.12.3/topics/collapseReplicates). Previous posts on this haven't really clarified this.

I've unsuccessfully transformed the gens x cts matrix, but successfully have done it on the DESeqDataSetFromMatrix derived dds matrix (code below), so I'm sure it's that; I get the expected number of columns reduced based on the number of replicates I have.

However: (1) Why does this has to be done on a DESeq Data Set? (2) Exactly what is happening on the back end? (3) Is it proper to continue with DESeq and the other downstream analysis (like I've started below) as is?

relevant code:

dds <- DESeqDataSetFromMatrix(countData = cts,
                             colData = coldata,
                              design = ~ condition)
ddsColl <- collapseReplicates(dds, dds$sample, renameCols = TRUE)
#perform differential expression analysis
dds <- DESeq(ddsColl)
res <- DESeq(ddsColl)

Posting because (1) I don't really know much about the backend of DESeq2 and don't want to blindly slap functions on my data (2) I'm genuinely curious.

deseq2 • 1.5k views

ADD COMMENT • link updated 5.7 years ago by Michael Love 43k • written 5.7 years ago by wiscoyogi • 0

score 0 · Answer 1 · 2019-08-19

0

Entering edit mode

Michael Love 43k

@mikelove

Last seen 15 hours ago

United States

The function man page makes it clear, the input should be a DESeqDataSet (or SummarizedExperiment, either is accepted).

ADD COMMENT • link 5.7 years ago Michael Love 43k