Hi all,
Sorry for this naive question, but I haven't been able to find a response anywhere.
I'm using DESeq2 to find DE genes between two conditions on RNAseq public data, with 19 "non responders" samples, and 22 "responders" samples.
After the DE analysis I've checked the samples read counts of the genes with the best adjusted pvalues.
I often see genes with counts that are low for most samples, but there are few samples with very high counts in one condition, which drives the gene to be given DE I guess.
I'm questionning the biological relevance of those results and I'm wondering if I maybe missed or misunderstood something that would avoid those kind of results.
here is my code :
dds <- DESeqDataSetFromMatrix(countData = count_table, colData = coldata, design = ~ condition)
keep <- rowSums(counts(dds)) >= 10
dds <- dds[keep,]
dds$condition <- relevel(dds$condition, ref = "R")
dds <- DESeq(dds)
res05 <- results(dds, alpha=0.05)
Thanks in advance for any input !
Ok thank you very much for this information, I will dig into lfcShrink !