Hi,
I am unsure as to whether the EdgeR normalization factors (calculated with calcNormFactors) take into account library size. From my reading, they only deal with RNA composition and library size is dealt with elsewhere but I unclear on this. Could someone please clarify the steps of normalization in EdgeR?
Many thanks,
Lucy
Thank you Aaron and James. I am actually trying to decide what to normalize by when I generate BigWig files using deepTools - should I use the edgeR normalization factor or the effective library size (normalization factor x library size) - the latter seems to make most sense?
Yes, it makes more sense to scale (i.e., divide) your BigWig coverage by the effective library size. I assume you are dealing with RNA-seq data? If you are dealing with other forms of sequencing data related to genomic coverage, there may be other biases involved. These require more care when you compute normalization factors - for example, see the csaw user's guide for some details about computing these factors for ChIP-seq data.
Great thank you. Yes, I am dealing with RNA-seq data, although I also have ATAC-seq data. What would you recommend for scaling this?
I haven't personally dealt with ATAC-seq data, but a few of my colleagues have used csaw (or at least its normalization) for it, and they seemed fairly satisfied, so...