Hi Everyone!!
I am analyzing 4 tissue types obtained from the 4 different layers of a plant, sampled at 4 different time points ( days after flowering (daf). Each tissue has 2-3 biological replicate. The samples are grouped as follows
sample_id | group |
---|---|
cellArep116daf A16daf | |
cellArep216daf | A16daf |
cellArep316daf | A16daf |
cellBrep116daf | B16daf |
cellBrep216daf | B16daf |
cellBrep14daf | B4daf |
cellBrep24daf | B4daf |
cellBrep18daf | B8daf |
cellNrep116daf | N16daf |
cellNrep216daf | N16daf |
cellNrep316daf | N16daf |
cellNrep14daf | N4daf |
cellNrep24daf | N4daf |
cellNrep34daf | N4daf |
cellNrep18daf | N8daf |
cellNrep28daf | N8daf |
cellNrep38daf | N8daf |
cellOrep116daf | O16daf |
cellOrep14daf | O4daf |
cellOrep24daf | O4daf |
cellOrep34daf | O4daf |
cellOrep18daf | O8daf |
cellOrep28daf | O8daf |
I wish to perform differential transcript usage analysis to understand how they differ between different time points for same cell type (say differences between B4daf vs B8daf and so on) and then for same time point but different cell types (B16daf vs A16daf). I am using DRIMSeq for that purpose.
I came across this post wherein the section Create the design matrix without intercept due to no clear 'ground state is somewhat unclear to me (the phrase itself is unclear to me). For a scenario like mine where there is no control case condition, how removing the intercept from the design matrix will affect my analysis, and is it desirable to do so because here it is mentioned that we should never remove the intercept term.
I am confused as to how should I go about it. Also setting one_way = FALSE
would be appropriate in my case and how should I set contrast if I am comparing 4 tissues at a time harvested on the same day (say 16daf i.e A16daf vs B16daf vs N16daf vs O16daf)?
So I myself tried comparing the results obtained with/without intercept on my data and I found more genes with differential transcript usage while using intercept. Also, I observed more genes having significant p-values when using intercept. I used the following code:
And without intercept
Will it be worthwhile to go with the intercept approach simply because it gives us more genes with differential transcript usage? In case yes, how do I explain this discrepancy?