Question

Question - Merged Arrays and Averaging Method

0

Entering edit mode

Marcos Pinho ▴ 30

@marcos-pinho-2600

Last seen 10.6 years ago

An embedded and charset-unspecified text was scrubbed... Name: not available Url: https://stat.ethz.ch/pipermail/bioconductor/attachments/20080123/ 2376d66c/attachment.ksh

• 588 views

ADD COMMENT • link updated 17.3 years ago by Oosting, J. PATH ▴ 550 • written 17.3 years ago by Marcos Pinho ▴ 30

score 0 · Answer 1 · 2008-01-23

> I am a new user of the Bioconductor suite and have a question for the > list. When performing an analysis, since I have duplicate arrays, I am > merging my duplicate arrays before normalizing my data. I have an option > to use the mean or median values and also to log2 transform my data before > averaging. Would anybody with experience care to comment about the > advantages or disadvantges regarding merging, averaging and log > transforming your data during the analysis. > Hi Marcos, If you have duplicates of all arrays in your experiment it is worthwhile to use this data. The limma package for instance has functionality to use technical replication in the analysis. (see duplicateCorrelation() function) If you do not have duplicates for all samples there are several things to consider. - If you do average the arrays, then the samples that have a replicate will show lower variability of overall gene expression. A varying amount of variability between samples is not good for statistics - Averaging should be performed on transformed (log or vsn) values. - for a duplicate the mean and median do not differ, only for 3 or more replicates medians will be more robust than means Personally when I do not have duplicates for all samples I discard one of the duplicates, usually after checking which of the two comes out better in QC. Jan Oosting