Hi,
I have pairs of transcripts that overlap on the genome, but I am not sure if they actually belong to the same gene or not. I have expression data for many different samples and I was thinking to perform a correlation test between each overlapping pair.
I can do cor.test() with method="spearman", but I guess maybe this is not the best for RNA-seq data? And also I need to correct for multiple testing.
Any suggestions on how to best perform this test? I want to remove all the gene pairs that are significantly correlated.
Thanks,
Jon
You are correct to be puzzled :)
I should have mentioned that I do not know from which strand the reads transcripts originate from. And so therefore the overlapping transcripts could be on separate strands.
About the expression measurements, they were obtained by mapping RNA-seq reads to the genome, and then counting based on gff-files. And you are right that some of the same reads will have been used to quantify if the transcripts overlap. I need to think about that... Good point!