Hi,
I have a question regarding the running time of DESeq function in DESeq2.
Does the running time of DESeq function is much more longer for design factor containing two variable compare to design factor containing only one variable?
For example does the running time for the condition 1 is much more longer compare to condition 2?
Condition 1:
dds_1 <- DESeqDataSetFromMatrix(countData = bigdf_t, colData = sample_info, design = ~ subject + condition)
dds_1 <- DESeq(dds_1, parallel = TRUE)
Condition 2:
dds_1 <- DESeqDataSetFromMatrix(countData = bigdf_t, colData = sample_info, design = ~ condition)
dds_1 <- DESeq(dds_1, parallel = TRUE) ```
I would like to add that in both condition the number of samples and genes are the same.
Thanks a lot for the answer in advance.
Cannot come up with a precise answer other than 'not much'. For normal sized analysis with tens to hundreds of samples that will take a few seconds unless you pump it with many covariates. It's really not much if an issue? Do you experience any problems?
The data comprises 100 samples and around 3 million covariates.
When I ran the code for the condition 2 ( DESeqDataSetFromMatrix(countData = bigdf_t, colData = sample_info, design = ~ condition)) it took around 5 6 hours to gave me the result, however the code is now running for around 23 hours for the condition 1 (DESeqDataSetFromMatrix(countData = bigdf_t, colData = sample_info, design = ~ subject + condition) and still I do not get any result. it is in gene-wise dispersion estimates status with out giving me any error or so...
Do you think it is ok or there is sth wrong there?