For my data set 9 control and 7 cases are there
The following matrix is the design matrix generated by command
design1=model.matrix(~mm)
where mm
is
mm=pData(eset)$treatment
and eset
has been got by following command
m<-as.matrix(read.csv("Alzheimer.csv", sep=",", row.names=1))
tmp <- read.csv("pdata.txt", row.names = 1,sep=",")
pdata <- AnnotatedDataFrame(tmp)
eset <- new("ExpressionSet", exprs = m, phenoData = pdata)
mm=pData(eset)$treatment
design1
(Intercept) mm
1 1 0
2 1 0
3 1 0
4 1 0
5 1 0
6 1 0
7 1 0
8 1 0
9 1 0
10 1 1
11 1 1
12 1 1
13 1 1
14 1 1
15 1 1
16 1 1
attr(,"assign")
[1] 0 1
I am very new to the area for microarray data analysis I am making contrast matrix by the following commands and getting some warnnings Please help me in clarifying the points. I am confused whether I am doing correctly or not.
> contrastm <- makeContrasts(mm, levels = design1)
Warning message:
In makeContrasts(mm, levels = design1) : Renaming (Intercept) to Intercept
>contrastm
Contrasts
Levels mm
Intercept 0
mm 1
> fit2 <- contrasts.fit(fit, contrastm)
Warning message:
In contrasts.fit(fit, contrastm) :
row names of contrasts don't match col names of coefficients
> fit2b <- eBayes(fit2)
> topTable(fit2b,coef=1)
logFC AveExpr t P.Value adj.P.Val B
215306_at 431.0302 376.9312 8.366248 5.120111e-07 0.01140914 -4.457608
218801_at -281.7603 491.2188 -7.091080 3.782164e-06 0.02504624 -4.467288
203894_at -413.7508 862.6063 -7.063322 3.959712e-06 0.02504624 -4.467541
219746_at 283.7810 439.5875 6.986785 4.496026e-06 0.02504624 -4.468248
213666_at -419.2190 702.4250 -6.299508 1.457800e-05 0.05814260 -4.475374
203461_at 222.0429 227.0438 6.255007 1.576693e-05 0.05814260 -4.475888
213929_at 151.5079 209.1625 6.171945 1.826497e-05 0.05814260 -4.476866
209129_at 218.1206 436.3500 6.015035 2.417706e-05 0.06734219 -4.478782
218456_at -700.8302 1294.4313 -5.822691 3.425263e-05 0.08330774 -4.481258
200980_s_at -334.3381 1216.2938 -5.774792 3.738623e-05 0.08330774 -4.481897
why logFC is value is so high when thresold is only 1.5
-ve values are down regulated gene and prositive values are uprelulated genes?
The data and the code I have pasted here
That's just a part of your code. And looking at that GSE I don't see where there's any treatment? There is also no need to do it the way you have. An alternative is to do
So the only changes are between the severe Alzheimer disease and the controls, and there are about 411 genes with an FDR of 5%.