Entering edit mode
jonas
•
0
@jonas-14423
Last seen 7.2 years ago
I have a RNA-Seq dataset and want to test differential gene expression with DESeq2 for just some of the genes but for many different phenotypes. Now I have 3 questions:
- I have to give DESeq all the genes and then only subset the results to the genes of interest, correct?
- The most time-consuming step seems to be the dispersion estimation. Is this dependent on the design matrix, i.e. the phenotype? Or is there a way to correct for dispersion once and then perform the differential expression test for multiple phenotypes on the same normalized dataset?
- If 2 is possible: Can I do the testing for only some of the genes after I normalised the dataset with all the genes?
2) no I mean actually different phenotypes, i.e. different variables each of them with 2 groups. Could I normalise like with this:
And then do the tests for each phenotype with
3) yes, I know, I actually have only few genes of interest. But if I subset the genes after the normalisation and before the differential expression test like this?
No you can't change the design after calculating dispersion. The dispersion only make sense relative to a certain design.
Imagine, if you set ~1, and you have a gene with DE across two groups of samples. You will get a high dispersion estimate, because the design doesn't allow for different mean counts for the two groups. Then you switch to ~condition, you would get a lower dispersion estimate.
Yes, you can subset before differential expression testing.
ok, thanks!