Entering edit mode
Hi
I want to variance filter my RNA-seq data to increase statistical power similar to what I have seen done in microarray studies where the data groups are pretty similar. However I cannot use limma/deseq etc after the variance filter because it screws up the background variance, is it OK to do a t-test? I think there are problems with the normal distribution not fitting RNA-seq data, in which case, what to do? Any ideas?
Cheers.
I' new to rna seq analysis. So pardon my ignorance.
Why is variance filtering hardly ever a good idea, and almost certainly not with RNA-seq data?
What if your downstream analysis is a clustering algorithm instead of differential expression?
Best
For clustering that is fine, it makes sense. For differential expression many of the packages do variance sharing to increase power so you cannot remove the low variance ones. Actually I compared variance filtering + t test with limma for finding DE genes, and limma came out on top because it does this information sharing.
Oh cool. Thanks for the info!