Hi all,
I am looking to identify the differentially expressed genes within a cell line upon treatment, and have noticed that for some genes, despite there being significance in the log2FC at my chosen threshold of >1.5 (adj.p <0.05), when cross-checking against their corresponding normalized counts, the two groups show no significant difference. Since the log2FC is derived from the normalized counts, I'm having difficulty understanding why this would be the case, even if there was variability in my replicates/small sample size etc.
Would greatly appreciate if someone could clear this up for mem and confirm whether I would need validation from the normalized counts to confirm what I see on the log2FC level.
Cheers
Thank you so much for your reply.
Below I've only attached the code I used to normalize the raw counts, "vsd" was used to perform the t-test.
As an example, Exosc6 had a logFC >1.5 and adj. p.value <0.05 when comparing groups 2.5uM and DMSO. However, when performing a t.test (two-tail, paired) between these two groups on the normalized counts (vsd), the p.value is 0.079.
Is this a common occurrence and is it something to worry about?
To clarify, if I'd like to do validation of the genes by qPCR, do I have to ensure the normalized counts between the groups show significanceto confirm the logFC and pvalue observed?
Another question is I noticed that the normalized counts were 7.557 for a lot of the samples, and I think this may be the default value for those that show no expression. How is this value calculated?
Thank you again, please let me know if more information is required :)