Hello! I have a question about correcting for batch effects prior to differential gene expression analysis with limma. I've read that batch effect correction functions such as ComBat should not be used prior to differential expression analysis in limma, and that batch effects should be accounted for in linear modeling instead. However, in my case a batch effect and disease effect are one in the same, so if I account for the batch effect in the linear model the differential expression analysis will not include disease influences on differential gene expression. Therefore I'd like to re-run several healthy and disease samples, use those to calculate healthy and disease gene-wise normalization factors, and multiply out by those factors to eliminate the batch effect while maintaining disease effects. Is it acceptable to do the normalization using read per million data, back calculate to raw data using library sizes for each sample, and then do differential expression using limma? Thanks, all the best!
Adam