Analyzing mulitple tissues

0

Entering edit mode

Uri David Akavia ▴ 80

@uri-david-akavia-1277

Last seen 10.6 years ago

Hello. I had a run of chips, that included 3 kinds of tissues in different stages of differentiation. Only one repeat, due to budget and biological rarity reasons. The tissues are Cardiac1, Cardiac2, Cardiac3 Skeletal1, Skeletal2 MSC My method of analysis is MAS 5.0, filtering (removing absent and lower than 20) and quantile normalization of the present genes. I want to analyze the differentiation of Cardiac tissues, and the difference between MSC and Cardiac. My question is - how should I filter and normalize the tissues? Should I filter and normalize just the tissues I want to analyze (which will give me different lists of genes for Cardiac/MSC and Skeletal/MSC) or should I filter and analyze all 6 samples, then only comparing the tissues I want (I'm afraid this will give me a lot of noise, with genes expressed at low levels)? My problem with the first method is that it will be a different method for each set of tissues, even though I think it might be more accurate. Thank you very much. Yours, Uri David Akavia

Normalization Normalization • 1.6k views

ADD COMMENT • link updated 19.8 years ago by Gordon Smyth 52k • written 19.8 years ago by Uri David Akavia ▴ 80

0

Entering edit mode

Uri David Akavia ▴ 80

@uri-david-akavia-1277

Last seen 10.6 years ago

David Kipling wrote: > Hi > > a) Method 1. Use limma() on rma-processed data. [It doesn't like MAS5 for > reasons to do with the variance v expression relationship.]. You should then > be able to get a set of moderated t-statistic p-values for each of your pairwise > comparions, plus an overall moderated F statistic (which will pull out genes > changed between any state). Moderated stats are the way to go when you have so > few replicates....it circumvents a nasty false positive effect with such > granular data. Read the limma users guide (there is a command in the package > to bring this up). > I didn't say it, but my arrays are Affymetrix arrays - no dye swaps, no repeats. Is it actually possible to use limma (or any t-statistic) when you have 1 (and only one) value for each sample? The limma guide states that three repeats are prefered. This is strengthened by the examples they give in http://bioinf.wehi.edu.au/limma/usersguide.pdf, all of which have at least a dye swap. So, how can t-statistic work? > b) Method 2. Stick with MAS5, and select potentially differentially regulated > genes based on having a high covariance (sd/mean). You'll need to stabilise > the variance first; I have a script for this which I can send. [Don't use > vsn() on MAS5 data, it isn't designed for it....my script is.] > Indeed, but how do I do the basics? Filter on all 6 samples, normalize all 6, and then select the variant genes using 3 samples, then 4 samples and other 3 samples as criteria? Yours, Uri David Akavia

ADD COMMENT • link 19.8 years ago Uri David Akavia ▴ 80

0

Entering edit mode

Biological inference implies that the "signal" can be observed above the biological variation. If you have no biological replicates, you cannot determine if your signal is higher than the biological variation. So, there is no statistically valid means of analyzing your data that improves on an arbitrary choice of "fold difference", such as 2-fold difference. Naomi S. Altman 814-865-3791 (voice) Associate Professor Bioinformatics Consulting Center Dept. of Statistics 814-863-7114 (fax) Penn State University 814-865-1348 (Statistics) University Park, PA 16802-2111 At 07:07 AM 6/6/2005, Uri David Akavia wrote: >David Kipling wrote: > >>Hi >>a) Method 1. Use limma() on rma-processed data. [It doesn't like MAS5 for >>reasons to do with the variance v expressio n relationship.]. You >>should then >>be able to get a set of moderated t-statistic p-values for each of your >>pairwise >>comparions, plus an overall moderated F statistic (which will pull out genes >>changed between any state). Moderated stats are the way to go when you >>have so >>few replicates....it circumvents a nasty false positive effect with such >>granular data. Read the limma users guide (there is a command in the >>package >>to bring this up). >I didn't say it, but my arrays are Affymetrix arrays - no dye swaps, no >repeats. >Is it actually possible to use limma (or any t-statistic) when you have 1 >(and only one) value for each sample? The limma guide states that three >repeats are prefered. This is strengthened by the examples they give in >http://bioinf.wehi.edu.au/limma/usersguide.pdf, all of which have at least >a dye swap. So, how can t-statistic work? > >>b) Method 2. Stick with MAS5, and select potentially differentially >>regulated >>genes based on having a high covariance (sd/mean). You'll need to stabilise >>the variance first; I have a script for this which I can send. [Don't use >>vsn() on MAS5 data, it isn't designed for it....my script is.] >Indeed, but how do I do the basics? Filter on all 6 samples, normalize all >6, and then select the variant genes using 3 samples, then 4 samples and >other 3 samples as criteria? > >Yours, > >Uri David Akavia > >_______________________________________________ >Bioconductor mailing list >Bioconductor@stat.math.ethz.ch >https://stat.ethz.ch/mailman/listinfo/bioconductor Naomi S. Altman 814-865-3791 (voice) Associate Professor Bioinformatics Consulting Center Dept. of Statistics 814-863-7114 (fax) Penn State University 814-865-1348 (Statistics) University Park, PA 16802-2111

ADD REPLY • link 19.8 years ago Naomi Altman ★ 6.0k

0

Entering edit mode

Hi Naomi, I am glad you raised this issue of lack of replication. I fully take on board your point. Something is niggling me, however, so hopefully someone with a better stats background than me can help. What puzzles me is that limma will allow you to run experiments with no replication *and* will return t-statistics for something like a 3x1 comparison. I don't generally run experiments without replication (he says quickly, defending himself!) but I've just tried a mock experiment that is made up of 3 states with 4, 3, and 1 degrees of replication (see snippet below). Ordering by absolute(M) gives a ranking that is related to, but clearly distinct, from ranking by absolute(t statistic) or p-value. Out of curiosity, what is limma doing here and how should one interpret these t stats/p-values (if indeed one should!)? Are they any use over simple M values? Regards David ###################### # Based on an example in the limmaUsersGuide data1 <- ReadAffy() eset <- rma(data1) # Note the single chip in group 2 design <- model.matrix(~ -1+factor(c(1,1,1,1,3,3,3,2))) colnames(design) <- c("group1", "group2", "group3") fit <- lmFit(eset, design) # Contrast 2 is a 1 v 3-chip comparison contrast.matrix <- makeContrasts(group2-group1, group3-group2, group3-group1, levels=design) fit2 <- contrasts.fit(fit, contrast.matrix) fit2 <- eBayes(fit2) topTable(fit2, coef=2, adjust="none") ######################## Prof David Kipling Department of Pathology School of Medicine Cardiff University Heath Park Cardiff CF14 4XN Tel: 029 2074 4847 Email: KiplingD@cardiff.ac.uk On 6 Jun 2005, at 15:22, Naomi Altman wrote: > Biological inference implies that the "signal" can be observed above > the biological variation. If you have no biological replicates, you > cannot determine if your signal is higher than the biological > variation. > > So, there is no statistically valid means of analyzing your data that > improves on an arbitrary choice of "fold difference", such as 2-fold > difference. > > > Naomi S. Altman 814-865-3791 (voice) > Associate Professor > Bioinformatics Consulting Center > Dept. of Statistics 814-863-7114 (fax) > Penn State University 814-865-1348 (Statistics) > University Park, PA 16802-2111

ADD REPLY • link 19.8 years ago David Kipling ▴ 110

0

Entering edit mode

Gordon Smyth 52k

@gordon-smyth

Last seen 16 hours ago

WEHI, Melbourne, Australia

> Date: Tue, 7 Jun 2005 08:50:46 +0100 > From: David Kipling <kiplingd@cardiff.ac.uk> > Subject: Re: [BioC] Analyzing mulitple tissues > To: Naomi Altman <naomi@stat.psu.edu> > Cc: bioconductor@stat.math.ethz.ch > > Hi Naomi, > > I am glad you raised this issue of lack of replication. I fully take > on board your point. Something is niggling me, however, so hopefully > someone with a better stats background than me can help. > > What puzzles me is that limma will allow you to run experiments with no > replication limma does not compute t-statistics for p-values when there is no replication. In the absence of replication, limma simply computes fold changes. What is puzzling about that? > *and* will return t-statistics for something like a 3x1 > comparison. I don't generally run experiments without replication (he > says quickly, defending himself!) but I've just tried a mock experiment > that is made up of 3 states with 4, 3, and 1 degrees of replication > (see snippet below). Ordering by absolute(M) gives a ranking that is > related to, but clearly distinct, from ranking by absolute(t statistic) > or p-value. > > Out of curiosity, what is limma doing here and how should one interpret > these t stats/p-values (if indeed one should!)? Are they any use over > simple M values? Yes, they are almost always better than simply using fold changes. Using M-values alone would make no use of replication while the t-statistics make use of whatever replication is available. Put very simply, some replication is better than none. You seem to be concerned in your mock experiment that one of the states has no replication. The limma analysis estimates the variance for each gene from the replicates available for states 1 and 2 and applies that estimate to state 3 as well. This analysis is perfectly valid provided that the variability of the expression values is similar in state 3 to that in states 1 and 2. Even when the variability is different in state 3, the limma analysis still gives a better ranking than fold change, even for comparisons involving state 3, in most cases. The basic assumption is that, across genes, the variance in state 3 is positively associated with the variance in states 1 and 2. This is a very weak assumption which is almost always true in practice, as genewise differences in variability tend to dominate state-wise differences. Gordon > Regards > > David > > > > ###################### > > # Based on an example in the limmaUsersGuide > data1 <- ReadAffy() > eset <- rma(data1) > > # Note the single chip in group 2 > design <- model.matrix(~ -1+factor(c(1,1,1,1,3,3,3,2))) > colnames(design) <- c("group1", "group2", "group3") > > fit <- lmFit(eset, design) > > # Contrast 2 is a 1 v 3-chip comparison > contrast.matrix <- makeContrasts(group2-group1, group3-group2, > group3-group1, levels=design) > fit2 <- contrasts.fit(fit, contrast.matrix) > fit2 <- eBayes(fit2) > topTable(fit2, coef=2, adjust="none") > > ######################## > > > Prof David Kipling > Department of Pathology > School of Medicine > Cardiff University > Heath Park > Cardiff CF14 4XN > > Tel: 029 2074 4847 > Email: KiplingD@cardiff.ac.uk > On 6 Jun 2005, at 15:22, Naomi Altman wrote:

ADD COMMENT • link 19.8 years ago Gordon Smyth 52k

0

Entering edit mode

Dear Naomi, Gordon and Uri, If I might try to bring together Naomi's comments with those of Gordon and see if I have followed this correctly: Uri's original design is: Cardiac1, Cardiac2, Cardiac3 Skeletal1, Skeletal2 MSC That is, a 3x2x1 6-chip experiment. Naomi commented that with no replication (i.e. the single MSC chip) one cannot judge biological variation and the best thing to do is a simple fold-change: "...there is no statistically valid means of analyzing your data that improves on an arbitrary choice of 'fold difference', such as 2-fold difference." {Naomi} Then Gordon replied: >> Out of curiosity, what is limma doing here and how should one interpret >> these t stats/p-values (if indeed one should!)? Are they any use over >> simple M values? > > Yes, they are almost always better than simply using fold changes. Using > M-values alone would > make no use of replication while the t-statistics make use of whatever > replication is available. > Put very simply, some replication is better than none. > > You seem to be concerned in your mock experiment that one of the states has no > replication. The > limma analysis estimates the variance for each gene from the replicates > available for states 1 and > 2 and applies that estimate to state 3 as well. This analysis is perfectly > valid provided that > the variability of the expression values is similar in state 3 to that in > states 1 and 2. > > Even when the variability is different in state 3, the limma analysis still > gives a better ranking > than fold change, even for comparisons involving state 3, in most cases. The > basic assumption is > that, across genes, the variance in state 3 is positively associated with the > variance in states 1 > and 2. This is a very weak assumption which is almost always true in > practice, as genewise > differences in variability tend to dominate state-wise differences. If I follow Gordon correctly, his argument is that in an experimental design like this you can make an estimate for the variance of a probeset based on its behaviour in the other samples (with some opportunity for discussion as to how valid an assumption this is!). This results in a situation where not all fold changes are equal, and this will actually work better than a simple FC estimate for ranking the genes for further exploration. In other words, in the 3x2x1 design such as this you could get two probesets that had identical M values (calculated between the triplicate and single chips) *but* limma would rank the probeset with the higher overall variability across the six chips lower down the list (seen as a different p-value/t statistic). So Uri could use limma to study this 3x2x1 design and be able to extract potentially differentially regulated genes between the (single) MSC sample and either/both of the other two sample classes using the limma p-values returned, and this would be a more powerful approach than simple fold-changes - yes? This is a very interesting point for those of us in core facilities having to help users who insist - for reasons of finances, scarce samples, or the fact they the experiments are of a preliminary grant-generating nature - on doing small-scale experiments where some samples have no replication at all. Me telling them to go away and come back with 15-fold replication isn't particularly helpful(!), and instead suggestions as to how to wring the maximum information from such narrow datasets are what they need. Thanks everyone, David Professor David Kipling Department of Pathology School of Medicine Cardiff University Heath Park Cardiff CF14 4XN Tel: +44 29 2074 4847 Fax: +44 29 2074 4276 Email: KiplingD@cardiff.ac.uk

ADD REPLY • link 19.8 years ago David Kipling ▴ 110

0

Entering edit mode

Gordon Smyth 52k

@gordon-smyth

Last seen 16 hours ago

WEHI, Melbourne, Australia

>David Kipling KiplingD at cardiff.ac.uk >Tue Jun 7 15:09:17 CEST 2005 > >Dear Naomi, Gordon and Uri, > >If I might try to bring together Naomi's comments with those of Gordon and >see if I have followed this correctly: > >Uri's original design is: > >Cardiac1, Cardiac2, Cardiac3 >Skeletal1, Skeletal2 >MSC > >That is, a 3x2x1 6-chip experiment. > >Naomi commented that with no replication (i.e. the single MSC chip) one >cannot judge biological variation and the best thing to do is a simple >fold-change: "...there is no statistically valid means of analyzing your >data that improves on an arbitrary choice of 'fold difference', such as >2-fold difference." {Naomi} There isn't any conflict between Naomi's comments and my own. Naomi actually refered to "biological replication" rather than to replication per se. She was reacting to Uri's original post which made it very unclear whether there is any biological replication in his experiment at all, i.e., it may be that Cardiac1, Cardiac2 etc are not in fact biological replicates. Replication is a subtle business, and Uri would need to describe his process and population in much more detail than he done for more to be said. I may be wrong, but I doubt that Naomi was especially concerned about the single MSC chip. On the other hand, my comments were addressed at your mock experiment and were made on the basis that all replication for states 1 and 2 is true biological replication. >Then Gordon replied: > > >> Out of curiosity, what is limma doing here and how should one interpret > >> these t stats/p-values (if indeed one should!)? Are they any use over > >> simple M values? > > > > Yes, they are almost always better than simply using fold changes. Using > > M-values alone would > > make no use of replication while the t-statistics make use of whatever > > replication is available. > > Put very simply, some replication is better than none. > > > > You seem to be concerned in your mock experiment that one of the states > has no > > replication. The > > limma analysis estimates the variance for each gene from the replicates > > available for states 1 and > > 2 and applies that estimate to state 3 as well. This analysis is perfectly > > valid provided that > > the variability of the expression values is similar in state 3 to that in > > states 1 and 2. > > > > Even when the variability is different in state 3, the limma analysis still > > gives a better ranking > > than fold change, even for comparisons involving state 3, in most > cases. The > > basic assumption is > > that, across genes, the variance in state 3 is positively associated > with the > > variance in states 1 > > and 2. This is a very weak assumption which is almost always true in > > practice, as genewise > > differences in variability tend to dominate state-wise differences. All my comments below are made on the basis that all replication for states 1 and 2 is biological replication. >If I follow Gordon correctly, his argument is that in an experimental design >like this you can make an estimate for the variance of a probeset based on >its behaviour in the other samples (with some opportunity for discussion as >to how valid an assumption this is!). This results in a situation where >not all fold changes are equal, and this will actually work better than a >simple FC estimate for ranking the genes for further exploration. Yes. >In other words, in the 3x2x1 design such as this you could get two probesets >that had identical M values (calculated between the triplicate and single >chips) *but* limma would rank the probeset with the higher overall >variability across the six chips lower down the list (seen as a different >p-value/t statistic). Exactly. >So Uri could use limma to study this 3x2x1 design and be able to extract >potentially differentially regulated genes between the (single) MSC sample >and either/both of the other two sample classes using the limma p-values >returned, and this would be a more powerful approach than simple >fold-changes - yes? His design is actually 3+2+1 rather than 3x2x1. If his Cardiac and Skeletal samples are biological replicates, then yes. If not, see Naomi's comments. >This is a very interesting point for those of us in core facilities having >to help users who insist - for reasons of finances, scarce samples, or the >fact they the experiments are of a preliminary grant-generating nature - on >doing small-scale experiments where some samples have no replication at all. >Me telling them to go away and come back with 15-fold replication isn't >particularly helpful(!), and instead suggestions as to how to wring the >maximum information from such narrow datasets are what they need. Making the best use of small-scale experiments is the primary purpose of the limma software. In general, you can still do an analysis with only 1 chip for one of the groups, unless you have a strong reason to think that the variability of expression will be quite different in that group to the others. Generally speaking, the process will work best when the different groups (e.g., tissue types) are as similar as possible. Gordon >Thanks everyone, > >David > > > > >Professor David Kipling >Department of Pathology >School of Medicine >Cardiff University >Heath Park >Cardiff CF14 4XN > >Tel: +44 29 2074 4847 >Fax: +44 29 2074 4276 >Email: KiplingD at cardiff.ac.uk

ADD COMMENT • link 19.8 years ago Gordon Smyth 52k

0

Entering edit mode

Gordon Smyth wrote: > There isn't any conflict between Naomi's comments and my own. Naomi > actually refered to "biological replication" rather than to replication > per se. She was reacting to Uri's original post which made it very > unclear whether there is any biological replication in his experiment at > all, i.e., it may be that Cardiac1, Cardiac2 etc are not in fact > biological replicates. Replication is a subtle business, and Uri would > need to describe his process and population in much more detail than he > done for more to be said. I may be wrong, but I doubt that Naomi was > especially concerned about the single MSC chip. Actually, you're correct. Cardiac1, Cardiac2 are different stages of the same tissue. They are NOT biological replicates. However, I think my question was misunderstood. I use quantile normalization and select genes by fold change. I wish to see the differentiation of Cardiac, and to compare Cardiac to MSC. My question is: Should I normalize on all tissues, or simply on the tissues analyzed at the time? If I normalize all 6 samples, I'm afraid that the smaller differences between Cardiac will be masked by the larger differences between Cardiac and MSC. If I normalize only the tissues analyzed, it means that I will normalize multiple times, each time differently for different tissue (leading to different values). What is preferable? What are additional pros and cons for each method? Thank you, Uri David Akavia

ADD REPLY • link 19.8 years ago Uri David Akavia ▴ 80

0

Entering edit mode

The importance of using RMA or GCRMA rather than MAS is far greater than the impact of the factors you're discussing here. Furthermore, quantile normalisation is not designed to be used in conjunction with MAS and probe-set filtering. RMA will work best on all the chips at once, so I would personally do that. Simplest and best. Gordon At 04:52 PM 8/06/2005, Uri David Akavia wrote: >Gordon Smyth wrote: >>There isn't any conflict between Naomi's comments and my own. Naomi >>actually refered to "biological replication" rather than to replication >>per se. She was reacting to Uri's original post which made it very >>unclear whether there is any biological replication in his experiment at >>all, i.e., it may be that Cardiac1, Cardiac2 etc are not in fact >>biological replicates. Replication is a subtle business, and Uri would >>need to describe his process and population in much more detail than he >>done for more to be said. I may be wrong, but I doubt that Naomi was >>especially concerned about the single MSC chip. > >Actually, you're correct. >Cardiac1, Cardiac2 are different stages of the same tissue. >They are NOT biological replicates. > >However, I think my question was misunderstood. >I use quantile normalization and select genes by fold change. > >I wish to see the differentiation of Cardiac, and to compare Cardiac to MSC. >My question is: Should I normalize on all tissues, or simply on the >tissues analyzed at the time? >If I normalize all 6 samples, I'm afraid that the smaller differences >between Cardiac will be masked by the larger differences between Cardiac >and MSC. >If I normalize only the tissues analyzed, it means that I will normalize >multiple times, each time differently for different tissue (leading to >different values). > >What is preferable? What are additional pros and cons for each method? > >Thank you, > >Uri David Akavia

ADD REPLY • link 19.8 years ago Gordon Smyth 52k

0

Entering edit mode

I agree completely with Gordon's comments. The less replication you have, the more you need to rely on the statistical model. The basic model says that the variance does not change with the experimental condition. If you take this seriously, you could replicate in the "least expensive" condition. But it is a very strong assumption. An even stronger assumption is that ALL the genes have the same variance for the same condition - this is equivalent to selecting a "fold change" criterion. e.g. Do you really believe that the variability of the differentially expressing genes is the same in "normal" and "cancerous" tissue, or in "developing" and "mature" organs? What I tell people is that if you don't plan to replicate all your conditions, you should plan to have a statistician as a collaborator. Then at least you have someone who knows how to check the sensitivity of the analysis to the assumptions made, and possibly check the validity of the assumptions. As well, statistical optimal design can help in designing experiments with higher statistical power for the same cost (although sometimes the obvious design has the highest power), or cut your cost. I just reduced the array budget of 1 experiment by 25% by a very modest change in the design. Besides the reduction in the number of arrays, the number of animals was similarly reduced. --Naomi At 10:46 PM 6/7/2005, Gordon Smyth wrote: >>David Kipling KiplingD at cardiff.ac.uk >>Tue Jun 7 15:09:17 CEST 2005 >> >>Dear Naomi, Gordon and Uri, >> >>If I might try to bring together Naomi's comments with those of Gordon and >>see if I have followed this correctly: >> >>Uri's original design is: >> >>Cardiac1, Cardiac2, Cardiac3 >>Skeletal1, Skeletal2 >>MSC >> >>That is, a 3x2x1 6-chip experiment. >> >>Naomi commented that with no replication (i.e. the single MSC chip) one >>cannot judge biological variation and the best thing to do is a simple >>fold-change: "...there is no statistically valid means of analyzing your >>data that improves on an arbitrary choice of 'fold difference', such as >>2-fold difference." {Naomi} > >There isn't any conflict between Naomi's comments and my own. Naomi >actually refered to "biological replication" rather than to replication >per se. She was reacting to Uri's original post which made it very unclear >whether there is any biological replication in his experiment at all, >i.e., it may be that Cardiac1, Cardiac2 etc are not in fact biological >replicates. Replication is a subtle business, and Uri would need to >describe his process and population in much more detail than he done for >more to be said. I may be wrong, but I doubt that Naomi was especially >concerned about the single MSC chip. > >On the other hand, my comments were addressed at your mock experiment and >were made on the basis that all replication for states 1 and 2 is true >biological replication. > >>Then Gordon replied: >> >> >> Out of curiosity, what is limma doing here and how should one interpret >> >> these t stats/p-values (if indeed one should!)? Are they any use over >> >> simple M values? >> > >> > Yes, they are almost always better than simply using fold changes. Using >> > M-values alone would >> > make no use of replication while the t-statistics make use of whatever >> > replication is available. >> > Put very simply, some replication is better than none. >> > >> > You seem to be concerned in your mock experiment that one of the >> states has no >> > replication. The >> > limma analysis estimates the variance for each gene from the replicates >> > available for states 1 and >> > 2 and applies that estimate to state 3 as well. This analysis is >> perfectly >> > valid provided that >> > the variability of the expression values is similar in state 3 to that in >> > states 1 and 2. >> > >> > Even when the variability is different in state 3, the limma analysis >> still >> > gives a better ranking >> > than fold change, even for comparisons involving state 3, in most >> cases. The >> > basic assumption is >> > that, across genes, the variance in state 3 is positively associated >> with the >> > variance in states 1 >> > and 2. This is a very weak assumption which is almost always true in >> > practice, as genewise >> > differences in variability tend to dominate state-wise differences. > >All my comments below are made on the basis that all replication for >states 1 and 2 is biological replication. > >>If I follow Gordon correctly, his argument is that in an experimental design >>like this you can make an estimate for the variance of a probeset based on >>its behaviour in the other samples (with some opportunity for discussion as >>to how valid an assumption this is!). This results in a situation where >>not all fold changes are equal, and this will actually work better than a >>simple FC estimate for ranking the genes for further exploration. > >Yes. > >>In other words, in the 3x2x1 design such as this you could get two probesets >>that had identical M values (calculated between the triplicate and single >>chips) *but* limma would rank the probeset with the higher overall >>variability across the six chips lower down the list (seen as a different >>p-value/t statistic). > >Exactly. > >>So Uri could use limma to study this 3x2x1 design and be able to extract >>potentially differentially regulated genes between the (single) MSC sample >>and either/both of the other two sample classes using the limma p-values >>returned, and this would be a more powerful approach than simple >>fold-changes - yes? > >His design is actually 3+2+1 rather than 3x2x1. If his Cardiac and >Skeletal samples are biological replicates, then yes. If not, see Naomi's >comments. > >>This is a very interesting point for those of us in core facilities having >>to help users who insist - for reasons of finances, scarce samples, or the >>fact they the experiments are of a preliminary grant-generating nature - on >>doing small-scale experiments where some samples have no replication at all. >>Me telling them to go away and come back with 15-fold replication isn't >>particularly helpful(!), and instead suggestions as to how to wring the >>maximum information from such narrow datasets are what they need. > >Making the best use of small-scale experiments is the primary purpose of >the limma software. > >In general, you can still do an analysis with only 1 chip for one of the >groups, unless you have a strong reason to think that the variability of >expression will be quite different in that group to the others. Generally >speaking, the process will work best when the different groups (e.g., >tissue types) are as similar as possible. > >Gordon > >>Thanks everyone, >> >>David >> >> >> >> >>Professor David Kipling >>Department of Pathology >>School of Medicine >>Cardiff University >>Heath Park >>Cardiff CF14 4XN >> >>Tel: +44 29 2074 4847 >>Fax: +44 29 2074 4276 >>Email: KiplingD at cardiff.ac.uk > >_______________________________________________ >Bioconductor mailing list >Bioconductor@stat.math.ethz.ch >https://stat.ethz.ch/mailman/listinfo/bioconductor Naomi S. Altman 814-865-3791 (voice) Associate Professor Bioinformatics Consulting Center Dept. of Statistics 814-863-7114 (fax) Penn State University 814-865-1348 (Statistics) University Park, PA 16802-2111

ADD REPLY • link 19.8 years ago Naomi Altman ★ 6.0k

Login before adding your answer.