sub-setting arrays
2
0
Entering edit mode
Assa Yeroslaviz ★ 1.5k
@assa-yeroslaviz-1597
Last seen 5 weeks ago
Germany
Hi everybody I would like to know if there is a way for sub-setting big data.frame of different chromosomes into separate data.frames for the specific chromosomes. I have the results of a microarray experiment in one huge excel sheet. one of the columns is df$Chromosomes I know I can sub-set it using for example: data_9=subset(data,Chromosome=="9") But I would like to know (probably for the next analysis) if there is a way of automating it. Can I, with one command separate my big file into 24 separate data frames (chromosomes 1-22, X,Y)? Thanks Assa [[alternative HTML version deleted]]
Microarray Microarray • 997 views
ADD COMMENT
0
Entering edit mode
@vincent-j-carey-jr-4
Last seen 12 weeks ago
United States
The standard practice in Bioconductor with expression data is to manage the information on expression plus information on samples in a unified way in an ExpressionSet. This has many benefits. Were you to do this, you could get chromosome-specific limitation of expression data in various ways. If the data are from a well-annotated chip family, with a standard .db annotation package, say chp.db, you could do, e.g., X9 = X[ get("9", revmap(chpCHR)), ] # use an appropriate chromosome identifier for the package and all the sample information would propagate. If you were not so lucky, but still used expression set structure, you could put the chromosome information in the featureData component, and use X9 = X[ fData(X)$chrom == "9", ] On Mon, Nov 28, 2011 at 9:30 AM, Assa Yeroslaviz <frymor@gmail.com> wrote: > Hi everybody > > I would like to know if there is a way for sub-setting big data.frame of > different chromosomes into separate data.frames for the specific > chromosomes. > > I have the results of a microarray experiment in one huge excel sheet. one > of the columns is df$Chromosomes > I know I can sub-set it using > for example: > data_9=subset(data,Chromosome=="9") > But I would like to know (probably for the next analysis) if there is a way > of automating it. Can I, with one command separate my big file into 24 > separate data frames (chromosomes 1-22, X,Y)? > > Thanks > Assa > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor@r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor > [[alternative HTML version deleted]]
ADD COMMENT
0
Entering edit mode
Heidi Dvinge ★ 2.0k
@heidi-dvinge-2195
Last seen 10.3 years ago
Hi Assa, > Hi everybody > > I would like to know if there is a way for sub-setting big data.frame of > different chromosomes into separate data.frames for the specific > chromosomes. > > I have the results of a microarray experiment in one huge excel sheet. one > of the columns is df$Chromosomes > I know I can sub-set it using > for example: > data_9=subset(data,Chromosome=="9") > But I would like to know (probably for the next analysis) if there is a > way > of automating it. Can I, with one command separate my big file into 24 > separate data frames (chromosomes 1-22, X,Y)? > Try something like split(data, data$Chromosome) HTH \Heidi > Thanks > Assa > > [[alternative HTML version deleted]] > > _______________________________________________ > Bioconductor mailing list > Bioconductor at r-project.org > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: > http://news.gmane.org/gmane.science.biology.informatics.conductor >
ADD COMMENT

Login before adding your answer.

Traffic: 616 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6