Hi,
I am seeking advise on model specification for a gene expression data set (24 genes) with the following covariates: age (n=5) and phenotype (n=2, control and disease). The samples are independent (one organ per animal). I would like to evaluate time-dependent (age-dependent) between class (between phenotype) gene expression differences. From the vignette, it appears that the following model specification is appropriate:
cov <- data.frame(tme=age, grp=phenotype)
null_model <- ~grp + ns(tme, df=4, intercept=F)
full_model <- ~grp + ns(tme, df=4, intercept=F) + (grp):ns(tme, df=4, intercept=F)
However, I recognize that this is the model specification for longitudinal sampling, as contrasted to independent sampling.
Could someone please advise?
Thanks, Warren
Hi Andy,
Could you provide a little more information about the basis for choosing df? I looked through the 2005 PNAS paper and its SI and didn't see a justification for using 2 df for the endotoxin analysis.For what it's worth, I'm doing a longitudinal time course qPCR experiment.
Thanks for any help.
Joe