Entering edit mode
Ken Termiso
Last seen 10.5 years ago
Hi Sean,
Maybe from now on I'll just email you directly instead of the mailing
Thanks for your reply...I did something similar to get what I wanted
and I
think it's pretty simple (provided you're using the affy annotation
Say I wanted to get a list of all genes related to cell death:
# "ann" is a data frame containing the affy annotation file
HG-U133A_2_annot_csv.zip off their website
i <- grep("death", as.vector(ann$Gene.Ontology.Biological.Process))
j <- grep("apoptosis",
k <- union(i,j)
k <- sort(k) #optional, but the union arg returns unsorted
te <- data.frame()
te <- ann[k,]
Now "te" is a data frame that contains the genes that have either
"death" or
"apoptosis" mentioned in GO Biol. Proc.
I see there in your reply that you are using the annotate library --
only annotation I've used is the affy file, which contains a lot of
-- do you recommend using the annotate library over this? I've been
the affy file b/c it's a simple .csv file and thus is pretty
to work with in R.
Thanks again,
>From: Sean Davis <sdavis2@mail.nih.gov>
>To: "Ken Termiso" <jerk_alert@hotmail.com>
>CC: bioconductor@stat.math.ethz.ch
>Subject: Re: [BioC] Anyone have a GO slim list for Affy HG-U133A or
>HG-U133Av2? Date: Thu, 24 Feb 2005 12:47:57 -0500
>You could certainly produce such a list by repeating what they have
done on
>that website. For example, for GO_slim, biologic process 3 (cell
cycle and
>genes <-
>genes <- genes[!duplicated(genes)]
>This will contain the genes in cell cycle and proliferation. It
>be hard to automate this process for each category. For those
>that include EXCLUDE descriptions, you can use R set commands like
%in% to
>get the sets you want.
>On Feb 24, 2005, at 11:44 AM, Ken Termiso wrote:
>>I'm using the affy HG-U133A_2_annot_csv.zip annotation file to
annotate my
>>data (which may be a bad idea to begin with..?), and would like to
be able
>>to use the GO slim categories to annotate my data (see
>>http://www.spatial.maine.edu/~mdolan/MGI_GO_Slim.html), instead of
>>extremely detailed GO categories already present in the affymetrix
file -
>>Gene.Ontology.Biological.Process, Gene.Ontology.Cellular.Component,
>>Basically, the issue is that I don't want to have the 2,000 - 5,000
>>different annotation groups in my file. I want to be able to run
>>on very general groups, like "development" or "death".
>>Thanks in advance,
>>Bioconductor mailing list