Hello, I am working with RNA Seq samples from an experiment similar to that described in section 9.7 of the limma user guide (multi-level experiments). The difference is that we have multiple fastq files per subject per condition per tissue. For example:
File01a - Subject1 - diseased - tissueA File01b - Subject1 - diseased - tissueA File02a - Subject1 - diseased - tissueB File02b - Subject1 - diseased - tissueB ... Is there a recommendation on how best to analyze this set to compare diseased vs healthy? Can duplcateCorrelation be applied with block=subject tissue? Should one of the duplicate pairs be removed based on some QC measure? Is there some way to combine the data from the duplicate pairs of fastq files?
Thanks in advance, richard
Are the duplicate fastq files technical replicates or biological replicates? If you extracted RNA just once for each subject-condition but sequenced twice to get more reads then that's a technical replicate. If you extracted RNA twice from separate tissue or cell samples then that's a biological replicate.