Hi all
I have a set of 24 samples from 8 different experimental mice groups. Each group has 3 samples, (2 from batch 1 and 1 from batch 2).
I am confused about whether I should use combat_seq to obtain a batch correction before I do DESeq2 analysis or whether I should (as is often advised) include the batch
variable in the model I am going to use to perform DE analysis.
One problem I am facing with Combat_seq is that there are several genes with negative count values after the batch correction is performed and so DESeq errors out and won't create the dds
object
However, if I do DESeq by including the batch variable in the model, when I go on to generate heat maps etc. the clusters still are separated by batch and I can't seem to find any way to "combine" the batches.
Any suggestions will be greatly appreciated.
Can you show some plots and the colData? You should not have negative counts after Combat-Seq, can you show code (plots and code should always be posted rather than just text).
Including batch in the model isn't going to affect PCA much. It will just modify the fold changes and p-values. If you have a strong batch effect, there is no magic that will remove it completely.