Hi,
I would like to use Salmon count data and DEseq2 to identify differentially expressed genes with pre-defined size factors for the different samples (as total transcripts are biased between samples) :
sizeFactors(dds) <- size_factors
dds <- DESeq(dds)
#using pre-existing size factors
#estimating dispersions
#gene-wise dispersion estimates
#mean-dispersion relationship
#final dispersion estimates
#fitting model and testing
Do I understand correctly that using pre-existing size factors ignores normalisation factors derived from using 'avgTxLength' from assays(dds), correcting for library size
(i.e. when no pre-existing size factors were defined)?
How can I combine pre-existing size factors with average transcript length normalisation to derive normalisationFactors
?
Thanks!
That's exactly what I was looking for!
Thanks a lot, Michael