Hi all
I have a data set of 27 sample from 9 different time-points with triplicates of control and knockouts.
sampleNames condition treatment TP batch A CTRL0 CTRL 0 1 B CTRL0 CTRL 0 1 C CTRL0 CTRL 0 1 D KO0 KO 0 1 E KO0 KO 0 1 F KO0 KO 0 1 G CTRL3 CTRL 3 1 H CTRL3 CTRL 3 1 I CTRL3 CTRL 3 1 J KO3 KO 3 1 K KO3 KO 3 1 L KO3 KO 3 1 M CTRL6 CTRL 6 1 N CTRL6 CTRL 6 1 O CTRL6 CTRL 6 1 P KO6 KO 6 1 P2 KO6 KO 6 2 Q KO6 KO 6 1 Q2 KO6 KO 6 2 R KO6 KO 6 1 R2 KO6 KO 6 2 S CTRL9 CTRL 9 1 T CTRL9 CTRL 9 1 U CTRL9 CTRL 9 1 V KO9 KO 9 1 W KO9 KO 9 1 X KO9 KO 9 1
I have already done a pair0wise analysis within each time-point, comparing the KO against the control. I would like now to test for genes differentially expressed in over time.
from previous experiment I know I can use the LRT test for that. This time it is different though, as I have control samples for each of the time points. in the previous analyses I have had a 0 time-point and have therefore used the reduced=~1
reduced model.
In the case above I would think the correct model will be
full_model = formula(~ treatment + TP + treatment:TP)
the reduced model I would like to use would be in this case
reduced_model = formula(~ treatment + TP)
And than run :
dds = DESeq(dds, test="LRT", full=full_model, reduced=reduced_model)
Am I correct in my assumption, that this will give me the genes differentially changed in a condition-specific manner over time?
For that I have two more questions -
1. should I use the condition
column rather than the treatment
column, where the each control and treated samples are time-point specific?
2. What should be changed, fig i would also like to include possible batch effects? Some samples were sequenced separately, due to technical difficulties.
would it be enough just add the batch effect to the two models like that?
full_model = formula(~ treatment + TP + batch + treatment:TP)
the reduced model I would like to use would be in this case
reduced_model = formula(~ treatment + TP + batch)
thanks
Assa
I see 4 time points not 9. Can you explain?
You only have 3 samples from batch 2 correct? Do these samples separate in the PCA plot?
1. sorry, yes these are only four time-points. I mixed it with the time-point 9.
2. Yes. the sample from the different batches fit perfectly well together in a PCA
BTW, does it matter what I put in the design parameter, when creating the dds object?. I was thinking of doing it like that:
dds<-DESeqDataSetFromMatrix(countData=countTable, colData=phenotype, design= ~ treatment + TP + batch + treatment:TP