I was running DESeq2 and I noticed a message/warning that I don't think I've seen before:
the design formula contains one or more numeric variables that have mean or standard deviation larger than 5 (an arbitrary threshold to trigger this message). it is generally a good idea to center and scale numeric variables in the design to improve GLM convergence.
I just want to make sure I understand correctly what it wants, because it's not mentioned in the vignette. So if I had a variable like age, which ranges from, say, 1 to 100, I should transform it like this:
age.rescaled <- ( age - mean(age) ) / sd(age)
Is this correct?
Sometimes I get a very different p-value when I do a simple linear transform of an independent variable.
That . . . doesn't seem right...