Bioconductors,
tl;dr: Does bitseq differeniate between biological and technical variation, if it does, how do I tell BitSeq which files are belong to which biological replicate
I am looking to use BitSeq to analyze my single end illumina data. I have a combinatorial design (3x2x2) where each condition has 3 biological replicates, and each biological replicate was split between 3 lanes to have 3 technical replicates. So I can use BitSeq::getExpression for every technical replicate, to estimate the expression of each transcript and produce a rpkm file.
I'll write each file name to describe the sample as c<condition>b<bioRep>t<techrep> (c1b1t1, c1b1t2, c1b1t3, c1b2t1, ..., c2b3t3).
If I had no technical replicates I would simple say
getDE(list("A" = c(c1b1,c1b2, c1b3), "B"=1(c2b1,c2b2,c2b3))
But if I include my technical replicates as
getDE(list("A" = c(c1b1t1, c1b1t2, c1b1t3, c1b2t1, ..., c1b3t1, ...), "B"=1(c2b1t1, c2b1t2, c2b1t3, c2b2t1, ...,c2b3t1, ...))
But this does not inform BitSeq of the relationship of the variance between each sample. (technical and biological variation). I have read the Bioinformatics paper (vol 28 no 13. 2012, pages 1721-1728) with some level of understanding, but I am by no means fluent/good with bayesian concepts. But I didn't see an explicit term for biological and technical variance (although I am used to dealing with tech reps by comparing a full model vs a reduced model, DESeq2 style). In section 3.4 DE analysis, second paragraph, the authors talk about combining the posterior probabilities, but I believe that is a direct reference to making Figure 5 b and d, not to how to feed in the data prior to DE calls.
Thanks for any wisdom!
Sam
Hi Sam,
Currently, BitSeq does not support technical replicates. I would proceed exactly as you mentioned at the second getDE command and then I would check DE consistency by comparing with the results arising when combining all technical replicates into a single sample (e.g: c1b1 = c1b1t1 + c1b1t2 + c1b1t3, c1b2 = c1b2t1 + c1b2t2 + c1b2t3 etc...) as Ryan suggested.