Hello,
I'm following the vignette on how to import expression estimates from Salmon with tximport and create an offset matrix. I also want to implement the pooling of technical replicates with sumTechReps()
. Technical replicates were analyzed separate with Salmon and will have different offsets.
I believe edgeR doesn't modify the raw counts and instead use the offsets in the GLM. And inspecting sumTechReps code, it seems that the function sums the counts and computes average normalization factors if you pass a DGEList object. Is that the correct way of doing this?
EDIT: When using sumTechReps(ID = sample_ids), the column names of y$counts and the row names of y$samples are converted to corresponding sample_ids, but column names of y$offset still refer to the original replicate IDs (although matrix is reduced to the same dimensions as the pooled count matrix). Can that be a problem?
Thank you!