[devteam-bioc] Why does a GOAllFrame contain more rows than its argument GoFrame?
0
0
Entering edit mode
Marc Carlson ★ 7.2k
@marc-carlson-2264
Last seen 8.4 years ago
United States
Hi Asis, So to get the most thorough answers to questions about bioconductor, please post them here at the official bioconductor list. The core team is a small group and we can't possibly presume to know as much as the 3000 other bright and talented people who subscribe to our list. ;) Answers to your questions: 1) GOALLFrame has many annotations because it is an expansion of GOFrame based on the Gene Ontology. The ontology is a DAG, and so it's graph like structure means that there are many ancestor terms for any given term which are (implicitly) related whenever a specific term is chosen. The GOALLFrame adds back those terms. 2) The extra GO terms come only from GO.db. The gene ids are simply repeated as appropriate when ancestor GO terms are added. 3) The current query used to extract parent terms does not discriminate based on the type of relationship. It just gets all the GO terms that are ancestors to the current set of terms and adds them to the list as also mapping to the same gene ids. The other code that was intended to be used downstream of this step should be doing something reasonable for you (with respect to relationship type) already. And since this is basically intended to be an annotation resource, it does not really make sense to filter results at this stage in the game. The GOAllFrame is meant to basically replace what you would find in a GO2ALL mapping from an annotation package. So it's not intended to be filtered in any way. I hope this explains things for you. Please let me know if you have other questions, Marc On 10/02/2013 09:26 AM, Maintainer wrote: > Dear Bioconductor Maintainers, > > thank you for providing such a useful tool-set. > > I have a question regarding the package AnnotationDbi, specifically > the classes GOFrame and GOALLFrame. > > During a GO Enrichment Analysis I create a data frame with Arabidopsis > thaliana GO annotations and from that first a GOFrame and than from > this GOFrame a GOALLFrame. Checking the result with > > nrow( getGOFrameData( athal.go.all.frame ) ) # The GOAllFrame > > and comparing it with > > nrow( athal.go.frame ) # The > GoFrame > > I realize that the GOALLFrame has more than 5 times more rows than my > original GO annotation table. If I provide > organism='Arabidopsis thaliana' > to the constructor of GOFrame this ratio increases even further. > > Unfortunately I could not find any documentation on this, so I feel > forced to bother you with my questions: > > 1) Why does GOALLFrame so many more annotations? > 2) Why and from where does it retrieve the organism specific ones that > are added when a model organism like 'Arabidopsis thaliana' is provided? > 3) I suspected that all ancestors of annotated terms are added, but > when I did so myself, I still got less GO term annotations? So do you > add ancestors of the "is_a" type and possibly other relationship types > like "part_of" etc. ? > > Please let me know your answers soon. > > Kind regards! > Asis Hallab > > > --- > Asis Hallab > Department Tsiantis > Max Planck Institute for Plant Breeding Research > Carl-von-Linné-Weg 10 > 50829 Köln (Cologne) > Germany > Phone: +49-221-5062-157 > > > ____________________________________________________________________ ____ > devteam-bioc mailing list > To unsubscribe from this mailing list send a blank email to > devteam-bioc-leave@lists.fhcrc.org > You can also unsubscribe or change your personal options at > https://lists.fhcrc.org/mailman/listinfo/devteam-bioc [[alternative HTML version deleted]]
Annotation GO Organism AnnotationDbi Annotation GO Organism AnnotationDbi • 1.3k views
ADD COMMENT

Login before adding your answer.

Traffic: 702 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6