Question

GOseq at a specific level ?

0

Entering edit mode

@amandinefournierchu-lyonfr-5921

Last seen 10.6 years ago

Dear all, I am using GOseq on a list of diffentially expressed genes and I would like to do an analysis at a specific GO level (level 4 for example) instead of the complete GO. Is there a tool in GOseq that already does it ? Or do I have to prepare the file for the gene2cat option by myself ? A somewhat similar question is : how to use the GOslim ontologies (a subset of the terms in the whole GO) in GOseq ? Thank you in advance. Best regards, Amandine ----- Amandine Fournier Lyon Neuroscience Research Center & Lyon Civil Hospital (France)

GO goseq GO goseq • 1.9k views

ADD COMMENT • link updated 11.0 years ago by Nadia Davidson ▴ 320 • written 11.1 years ago by amandine.fournier@chu-lyon.fr ▴ 80

score 0 · Answer 1 · 2014-03-25

<amandine.fournier at="" ...=""> writes: > > Dear all, > > I am using GOseq on a list of diffentially expressed genes and I > would like to do an analysis at a specific GO > level (level 4 for example) instead of the complete GO. > Is there a tool in GOseq that already does it ? Or do I have to > prepare the file for the gene2cat option by myself ? > A somewhat similar question is : how to use the GOslim ontologies > (a subset of the terms in the whole GO) in > GOseq ? > > Thank you in advance. > Best regards, > Amandine > > ----- > Amandine Fournier > Lyon Neuroscience Research Center > & Lyon Civil Hospital (France) Dear Amandine, Goseq doesn't let you select a specific GO level, but like you suggested, it's possible to do this manually with the gene2cat option. This would require the level of each GO term to be know. Unfortunately I can't point you towards a database with this information, but perhaps someone else on the bioconductor list knows. For GOslim, the situation is similar, in that you need to provide the gene2cat list yourself. Below is an example of how to create this using biomart. # First get the GOslim terms from biomart library("biomaRt") ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl") go_slim=getBM(attributes="goslim_goa_accession",mart=ensembl)[,1] # Now get the mapping between genes and all GO terms go=getgo(names(genes),"hg19","ensGene") # Filter the list for only GOslim terms go_slim2cat=lapply(go,function(x){ x[x %in% go_slim] }) # Run goseq with the slimmed list GO.wall=goseq(pwf,"hg19","ensGene",gene2cat=go_slim2cat) By the way, it came to our attention recently that goseq will sometimes rank the root GO terms like, GO:0003674 - Molecular Function, very high. This is to do with the way goseq handles genes with no GO annotation. It you find this is swamping your results you might like to try the developmental version of goseq, which gives an alternative way of handling genes without GO annotation. Cheers, Nadia.