using the hyperGTest function for E. coli genes
1
0
Entering edit mode
@coghlan-avril-3810
Last seen 10.2 years ago
Dear all, I am trying to use the hyperGTest function in the "GOStats" library to test whether a particular set of E. coli genes has any particular GO categories over-represented. I am having a little trouble figuring out how to do this, and would appreciate advice very much. I have two vectors containing lists of E. coli genes, "allgenes" containing all E. coli genes, and "genes" containing a particular subset of E. coli genes that I am interested in (and which I am suspect may be enriched for certain GO categories). I understand that I can use hyperGTest by typing something like this: > params = new("GOHyperGParams", geneIds=genes, universeGeneIds=allgenes, annotation= "ECOLI", ontolog="BP", pvalueCutoff=0.001, conditional=FALSE, testDirection="over") > mfhyper = hyperGTest(params) However, I'm afraid this doesn't seem to work for me. I think that I have the wrong thing in annotation="ECOLI", but I am not sure how to get E. coli GO annotation. Most of the examples that I can find seem to use human GO annotation data, which is already in Bioconductor. Do you know if I can get the E. coli GO annotation from Bioconductor somehow, and if so, will I then be able to use the hyperGTest function above? I will be very grateful for help, I have been going around in circles a little bit! Kind regards, Avril Avril Coghlan University College Cork, Ireland
Annotation GO Annotation GO • 929 views
ADD COMMENT
0
Entering edit mode
Marc Carlson ★ 7.2k
@marc-carlson-2264
Last seen 8.3 years ago
United States
Hi Avril, We presently have two E. coli packages, but only one of them has any GO data in it (this has to do with what was available when we made these packages). If your E. coli is a different strain than the common one, you will have to find annotations that connect the gene IDs to the the GO terms and use the AnnotationDbi package to create a custom package. The E. coli package that has GO data is for the common lab strain (K12 strain) of E. coli. This package is called "org.EcK12.eg.db". If that is the correct kind of E. coli, then you can proceed, and use that as your annotation parameter. However, you will also have to pay attention to the fact that this package uses Entrez Gene IDs as the primary gene identifiers for it's mappings (that is why it says "eg" in the name). So you will have to be also using entrez gene IDs as your identifiers when you set up the params object. Specifically "allGenes" and "genes" will have to both be lists of Entrez Gene IDs to use this annotation package in this way. Please let me know if you have any more questions, Marc Coghlan, Avril wrote: > Dear all, > > I am trying to use the hyperGTest function in the "GOStats" library to > test whether a particular set of E. coli genes has any particular GO > categories over-represented. I am having a little trouble figuring out > how to do this, and would appreciate advice very much. > > I have two vectors containing lists of E. coli genes, "allgenes" > containing all E. coli genes, and "genes" containing a particular subset > of E. coli genes that I am interested in (and which I am suspect may be > enriched for certain GO categories). > > I understand that I can use hyperGTest by typing something like this: > > >> params = new("GOHyperGParams", geneIds=genes, >> > universeGeneIds=allgenes, > annotation= "ECOLI", ontolog="BP", pvalueCutoff=0.001, > conditional=FALSE, > testDirection="over") > >> mfhyper = hyperGTest(params) >> > > > However, I'm afraid this doesn't seem to work for me. I think that I > have the wrong thing in annotation="ECOLI", but I am not sure how to get > E. coli GO annotation. Most of the examples that I can find seem to use > human GO annotation data, which is already in Bioconductor. > > Do you know if I can get the E. coli GO annotation from Bioconductor > somehow, and if so, will I then be able to use the hyperGTest function > above? > > I will be very grateful for help, I have been going around in circles a > little bit! > > Kind regards, > Avril > > Avril Coghlan > University College Cork, Ireland > > _______________________________________________ > Bioconductor mailing list > Bioconductor at stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor > Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor > >
ADD COMMENT

Login before adding your answer.

Traffic: 516 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6