Should I skip the eBayes step when using Limma for Affymetrix miRNA v1 chip?
1
0
Entering edit mode
Guest User ★ 13k
@guest-user-4897
Last seen 10.2 years ago
Dear List, I am working with Affymetrix's miRNA V1 chip, which uses very different probe sets for different molecule types, e.g. 4 identical probes for one miR, or 11 different probes for a snoRNA. I have read that the eBayes step assumes equal error variance between probe sets so it is not suitable for this kind of mixed set of probe set designs. To further complicate matters I am thinking about generating a custom CDF where the miR probe sets would have varied number of probes. http://pomelo2.bioinfo.cnio.es/help/pomelo2-help.html#toc10 Should I look at everything through Limma without the eBayes step (making it equivelant to a normal t-test?), or separate into several different analyses for different molecule types and only drop the eBayes step for the miRs (which will have varying sizes of probe sets)? Many thanks, Scott -- output of sessionInfo(): > sessionInfo() R version 3.0.2 (2013-09-25) Platform: x86_64-w64-mingw32/x64 (64-bit) locale: [1] LC_COLLATE=English_United Kingdom.1252 [2] LC_CTYPE=English_United Kingdom.1252 [3] LC_MONETARY=English_United Kingdom.1252 [4] LC_NUMERIC=C [5] LC_TIME=English_United Kingdom.1252 attached base packages: [1] stats graphics grDevices utils datasets methods base -- Sent via the guest posting facility at bioconductor.org.
miRNA cdf probe limma miRNA cdf probe limma • 2.0k views
ADD COMMENT
5
Entering edit mode
@gordon-smyth
Last seen 3 hours ago
WEHI, Melbourne, Australia

Dear Scott,

You are misinterpreting the empirical Bayes assumption of eBayes. eBayes doesn't assume equal variances, it only assumes that the variances can be considered as a random sample from the same distribution.

Omitting the eBayes step would would throw the baby out with the bath water.

I doubt that the error variance depends quite as directly on the number of probes in a probe-set as you might think.  When we have analysed the miRNA Affymetrix chip, we have found that it has major problems from the point of view of normalization, while the issue that you raise is relatively minor.

I could suggest ways to take into account the number of probes per probe-set in the eBayes calculations, but I don't think this will be important.

Best wishes
Gordon

PS. If you have the choice, RNA-seq is cheaper and better.

[This answer was originally posted to the Bioconductor mailing list on 22 August 2014.]

ADD COMMENT
1
Entering edit mode

Just for the record, here is how you can incorporate probe-set size into the eBayes step to see whether it is important.  I will simulate a little toy example where we know there should be a trend:

  library(limma)
  n <- 1:100 # number of probes per probe-set
  ID <- rep(1:100,n) # probe-set IDs
  nprobes <- length(ID)
  x <- matrix(rnorm(nprobes*3),nprobes,3)
  y <- avereps(x,ID=ID)

# y has 100 rows, each row is an average of n probes

  design <- matrix(1,3,1)
  fit <- lmFit(y,design)
  fit$Amean <- log(n)
  fit <- eBayes(fit,trend=TRUE)
  plotSA(fit)

Limma will estimate a decreasing trend of variance vs n as well as doing empirical Bayes squeezing around the trend.  The x-axis label of the plot will say "average expression" but it is actually log(n).

Best wishes
Gordon

[This answer was originally posted to the Bioconductor mailing list on 24 August 2014.]

ADD REPLY

Login before adding your answer.

Traffic: 718 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6