I want to use DESEQ to look for differential abundant genes between my patient groups. For my samples, I have 6 patients, with each patient having 3 timepoints (baseline, infection, resolution). I want to look for DAGs with the following contrasts; Baseline vs infection, infection vs resolution.
This is what I have done so far:
dds<-DESeqDataSetFromMatrix(countData = counts, colData = metadata, design = ~ Pathology)
keep=rowSums(counts(dds)) >=10
dds=dds[keep,]
dds<-DESeq(dds)
basevsinf=results(dds, contrast = c("Pathology", "Baseline", "Infection"))
I am finding several DAGs however when I look at their individual abundance I see that they are really only present in 1 or 2 samples. I thought maybe this is an issue with my DESEQ design as I am not taking into consideration that the time points are continuous from each patient. (For example I am testing differences between the baselines of patients 1 to 6 and the infection point of patients 1 to 6). How do I include the fact that there is some within patient sampling within the design?
Thanks for the help. I did try changing the design to:
But overall I am still seeing most of the DAGs tend to be present in only 2 samples of the 6. Is this just a short fall of only having 6 patients with 3 samples per patient?
Is there any issues with running:
I just tried that and I have less DAGs but they are definitely present in more samples
Again, sometimes, you see few DE genes because she you really don't have any detectable in your samples. Do your samples group nicely in PCA? If they don't, having few DE genes would be consistent with that.
No overall they don't group nicely so I'm not expecting many DAGs. The data looks more convincing using:
So I might stick that. Thanks for your help.