Dear all,
I'm trying to perform a particular kind of differential expression analysis. I'm using the DESeq2 package, because I think that its fold-change moderation is very important for the analysis, but I could also switch to other packages if needed.
I have three different biological conditions, say A,B,C, each with replicates. What I would like to perform is an analysis that could return genes whose expression is different between B and both A and C (in the same direction). The kind of fold-change and significance I would like to get in the end is something like an average of B vs A, B vs C.
But I wouldn't like to perform two pairwise comparisons: B vs A, B vs C, because I consider it important to rank the results and I wouldn't know how to weight the significance and fold-changes from the two analyses.
I also wouldn't like to simply treat A and C as the same condition. Because I think that would be a problem for the dispersion estimation, because A and C are quite different from each other.
I read DESeq2 and edgeR documentation, but I couldn't come up with a suitable method to get the results I'm interested in. Do you have any suggestion?
Thanks!
So ... trying to summarize what you are asking for more succinctly: are you interested in finding genes that are differentially expressed between B and the average of A and C, ie. B / mean(A,C)?
sorry I was not really synthetic. I don't exactly want to get the mean of A and C. I can give you a few examples:
gene A B C
gene1 100 200 150
gene2 150 200 150
gene3 100 200 200
From this example, I would like to get gene1 as the top significant, followed by gene2 and last gene3. Notice that the fold change between B and the average of A and C is the same for gene2 and gene3, but gene2 should be much more significant. I was thinking that possibly the best way I could get this, would be by doing a geometric mean of the two fold changes (BvsA, BvsC). What do you think?
I basically want to get DEGs that are significant both between BvsA and BvsC, allowing different fold-changes. The magnitude of these two fold-changes should help me rank all the genes, even when one is close to 0, like for gene3.
oops, answer below...