annaffy annotation problem (hgu133plus2)
4
0
Entering edit mode
@dipl-ing-johannes-rainer-846
Last seen 10.3 years ago
hi, i am wondering how the annotation of the Affymetrix chips are done... i know how the AnnBuilder package works and i also used it, but as the starting point for any annotation is the GenBank id for which the probe set codes i would like to know from where this mapping information comes. are those packages built using the probe set to genbank mapping that can be downloaded from affymetrix, or the (sometimes) newer annotation that is available through Affymetrix NetAffx or some other source? I am really interested in this, because we got a little annotation problem with the annotation for the hgu133plus2 chip (version 1.7.0). one of our interesting genes in our study was the probe set 208078_s_at which is mapped using the hgu133plus2 package (and the annotation from Affymetrix) to NM_030751 (TCF8). we made real time PCR to confirm its regulation but this TCF8 was not regulated at all. so i went back to NetAffx tool and saw that Affymetrix gave a new annotation for this probe set, all 11 probes perfectly match with the gene NM_173354 (SNF1LK) (i wonder how Affymetrix could annotate this gene to the TCF8 gene). if the annotation packages are built using the downloadable probe set to genbank mapping files from Affymetrix (which may never updated and corrected for errors by Affymetrix) i believe that there may be more annotation errors in this package like the one we found. cheers, jo
Annotation hgu133plus2 probe annotate AnnBuilder Annotation hgu133plus2 probe annotate • 2.0k views
ADD COMMENT
0
Entering edit mode
John Zhang ★ 2.9k
@john-zhang-6
Last seen 10.3 years ago
>Date: Mon, 4 Apr 2005 09:08:13 -0400 (EDT) >From: John Zhang <jzhang@jimmy.harvard.edu> >Subject: Re: [BioC] annaffy annotation problem (hgu133plus2) >To: johannes.rainer@tugraz.at >Cc: bioconductor@stat.math.echz.ch >MIME-Version: 1.0 >Content-MD5: Fxd6ky8TXHI0BkXVe+JUCw== > > >>i am wondering how the annotation of the Affymetrix chips are done... i >>know how the AnnBuilder package works and i also used it, but as the >>starting point for any annotation is the GenBank id for which the probe >>set codes i would like to know from where this mapping information >>comes. are those packages built using the probe set to genbank mapping >>that can be downloaded from affymetrix, or the (sometimes) newer >>annotation that is available through Affymetrix NetAffx or some other >>source? > The mappings were downloaded from the Affymetrix's "SUPPORT" link. It would be very useful if you could also inform Affymetrix about this error (and any other errors). Thank you for reporting. >> >>I am really interested in this, because we got a little annotation >>problem with the annotation for the hgu133plus2 chip (version 1.7.0). >>one of our interesting genes in our study was the probe set 208078_s_at >>which is mapped using the hgu133plus2 package (and the annotation from >>Affymetrix) to NM_030751 (TCF8). we made real time PCR to confirm its >>regulation but this TCF8 was not regulated at all. so i went back to >>NetAffx tool and saw that Affymetrix gave a new annotation for this >>probe set, all 11 probes perfectly match with the gene NM_173354 >>(SNF1LK) (i wonder how Affymetrix could annotate this gene to the TCF8 >>gene). >> >>if the annotation packages are built using the downloadable probe set >>to genbank mapping files from Affymetrix (which may never updated and >>corrected for errors by Affymetrix) i believe that there may be more >>annotation errors in this package like the one we found. >> >>cheers, jo >> >>_______________________________________________ >>Bioconductor mailing list >>Bioconductor@stat.math.ethz.ch >>https://stat.ethz.ch/mailman/listinfo/bioconductor > >Jianhua Zhang >Department of Medical Oncology >Dana-Farber Cancer Institute >44 Binney Street >Boston, MA 02115-6084 Jianhua Zhang Department of Medical Oncology Dana-Farber Cancer Institute 44 Binney Street Boston, MA 02115-6084
ADD COMMENT
0
Entering edit mode
hi john, i saw that they (=Affymetrix) updated their annotation files in the "SUPPORT" link on 1/04/05, so perhaps in this new version the gene is mapped correctly (i hope so). cheers, jo Quoting John Zhang <jzhang@jimmy.harvard.edu>: > >> Date: Mon, 4 Apr 2005 09:08:13 -0400 (EDT) >> From: John Zhang <jzhang@jimmy.harvard.edu> >> Subject: Re: [BioC] annaffy annotation problem (hgu133plus2) >> To: johannes.rainer@tugraz.at >> Cc: bioconductor@stat.math.echz.ch >> MIME-Version: 1.0 >> Content-MD5: Fxd6ky8TXHI0BkXVe+JUCw== >> >> >>> i am wondering how the annotation of the Affymetrix chips are done... i >>> know how the AnnBuilder package works and i also used it, but as the >>> starting point for any annotation is the GenBank id for which the probe >>> set codes i would like to know from where this mapping information >>> comes. are those packages built using the probe set to genbank mapping >>> that can be downloaded from affymetrix, or the (sometimes) newer >>> annotation that is available through Affymetrix NetAffx or some other >>> source? >> > > The mappings were downloaded from the Affymetrix's "SUPPORT" link. It > would be > very useful if you could also inform Affymetrix about this error (and > any other > errors). > > Thank you for reporting. > >>> >>> I am really interested in this, because we got a little annotation >>> problem with the annotation for the hgu133plus2 chip (version 1.7.0). >>> one of our interesting genes in our study was the probe set 208078_s_at >>> which is mapped using the hgu133plus2 package (and the annotation from >>> Affymetrix) to NM_030751 (TCF8). we made real time PCR to confirm its >>> regulation but this TCF8 was not regulated at all. so i went back to >>> NetAffx tool and saw that Affymetrix gave a new annotation for this >>> probe set, all 11 probes perfectly match with the gene NM_173354 >>> (SNF1LK) (i wonder how Affymetrix could annotate this gene to the TCF8 >>> gene). >>> >>> if the annotation packages are built using the downloadable probe set >>> to genbank mapping files from Affymetrix (which may never updated and >>> corrected for errors by Affymetrix) i believe that there may be more >>> annotation errors in this package like the one we found. >>> >>> cheers, jo >>> >>> _______________________________________________ >>> Bioconductor mailing list >>> Bioconductor@stat.math.ethz.ch >>> https://stat.ethz.ch/mailman/listinfo/bioconductor >> >> Jianhua Zhang >> Department of Medical Oncology >> Dana-Farber Cancer Institute >> 44 Binney Street >> Boston, MA 02115-6084 > > Jianhua Zhang > Department of Medical Oncology > Dana-Farber Cancer Institute > 44 Binney Street > Boston, MA 02115-6084 > > _______________________________________________ > Bioconductor mailing list > Bioconductor@stat.math.ethz.ch > https://stat.ethz.ch/mailman/listinfo/bioconductor >
ADD REPLY
0
Entering edit mode
@dipl-ing-johannes-rainer-846
Last seen 10.3 years ago
the biomaRt package is very nice, i also downloaded it and tested it, but as i am behind a (****) http proxy i am not able to connect to the according servers at ensembl. i assume the package connects directly through a SQL connection to the database... would it be possible to use a web service? cheers, jo Quoting Steffen Durinck <sdurinck@ebi.ac.uk>: > Hi, > > You could also try using the biomaRt package (see developmental > packages). biomaRt uses direct queries to the latest release of > Ensembl to get annotation information. Ensembl does an independent > mapping of the affy probe sequences to the genome sequences. This is > what I got for your affy id: > > > library(biomaRt) > > mart<-martConnect() > - Connected to ensembl_mart_29, ensembl_mart_29, ensembl_mart_29 and > ensembl_mart_29 - > > getGene(id="208078_s_at",array="hgu133plus2",mart=mart) > An object of class "martTable" > Slot "id": > [1] "208078_s_at" > > Slot "table": > $symbol > [1] "SNF1LK" > > $description > [1] "Probable serine/threonine-protein kinase SNF1LK (EC 2.7.1.37). > [Source:Uniprot/SWISSPROT;Acc:P57059]" > > $band > [1] "q22.3" > > $chromosome > [1] "21" > > $start > [1] 43658828 > > $end > [1] 43671436 > > $martID > [1] "ENSG00000142178" > > > Cheers, > Steffen > > Dipl.-Ing. Johannes Rainer wrote: > >> hi, >> >> i am wondering how the annotation of the Affymetrix chips are >> done... i know how the AnnBuilder package works and i also used it, >> but as the starting point for any annotation is the GenBank id for >> which the probe set codes i would like to know from where this >> mapping information comes. are those packages built using the probe >> set to genbank mapping that can be downloaded from affymetrix, or >> the (sometimes) newer annotation that is available through >> Affymetrix NetAffx or some other source? >> >> I am really interested in this, because we got a little annotation >> problem with the annotation for the hgu133plus2 chip (version >> 1.7.0). one of our interesting genes in our study was the probe set >> 208078_s_at which is mapped using the hgu133plus2 package (and the >> annotation from Affymetrix) to NM_030751 (TCF8). we made real time >> PCR to confirm its regulation but this TCF8 was not regulated at >> all. so i went back to NetAffx tool and saw that Affymetrix gave a >> new annotation for this probe set, all 11 probes perfectly match >> with the gene NM_173354 (SNF1LK) (i wonder how Affymetrix could >> annotate this gene to the TCF8 gene). >> >> if the annotation packages are built using the downloadable probe >> set to genbank mapping files from Affymetrix (which may never >> updated and corrected for errors by Affymetrix) i believe that there >> may be more annotation errors in this package like the one we found. >> >> cheers, jo >> >> _______________________________________________ >> Bioconductor mailing list >> Bioconductor@stat.math.ethz.ch >> https://stat.ethz.ch/mailman/listinfo/bioconductor >> > >
ADD COMMENT
0
Entering edit mode
Hi Jo, You could do a local install of the Ensembl biomart database. Normally the latest Ensembl release is downloadable from http://www.ebi.ac.uk/BioMart/install.html but I am not able to access this right now. If you then change in biomaRt the host to your local host you should be able to use it. You need to do this in the package R code but I'll update the package later this week so you can do this from within R. best, Steffen > the biomaRt package is very nice, > i also downloaded it and tested it, but as i am behind a (****) http > proxy i am not able to connect to the according servers at ensembl. i > assume the package connects directly through a SQL connection to the > database... would it be possible to use a web service? > > cheers, jo > > Quoting Steffen Durinck <sdurinck@ebi.ac.uk>: > >> Hi, >> >> You could also try using the biomaRt package (see developmental >> packages). biomaRt uses direct queries to the latest release of >> Ensembl to get annotation information. Ensembl does an independent >> mapping of the affy probe sequences to the genome sequences. This is >> what I got for your affy id: >> >> > library(biomaRt) >> > mart<-martConnect() >> - Connected to ensembl_mart_29, ensembl_mart_29, ensembl_mart_29 and >> ensembl_mart_29 - >> > getGene(id="208078_s_at",array="hgu133plus2",mart=mart) >> An object of class "martTable" >> Slot "id": >> [1] "208078_s_at" >> >> Slot "table": >> $symbol >> [1] "SNF1LK" >> >> $description >> [1] "Probable serine/threonine-protein kinase SNF1LK (EC 2.7.1.37). >> [Source:Uniprot/SWISSPROT;Acc:P57059]" >> >> $band >> [1] "q22.3" >> >> $chromosome >> [1] "21" >> >> $start >> [1] 43658828 >> >> $end >> [1] 43671436 >> >> $martID >> [1] "ENSG00000142178" >> >> >> Cheers, >> Steffen >> >> Dipl.-Ing. Johannes Rainer wrote: >> >>> hi, >>> >>> i am wondering how the annotation of the Affymetrix chips are >>> done... i know how the AnnBuilder package works and i also used it, >>> but as the starting point for any annotation is the GenBank id for >>> which the probe set codes i would like to know from where this >>> mapping information comes. are those packages built using the probe >>> set to genbank mapping that can be downloaded from affymetrix, or >>> the (sometimes) newer annotation that is available through >>> Affymetrix NetAffx or some other source? >>> >>> I am really interested in this, because we got a little annotation >>> problem with the annotation for the hgu133plus2 chip (version >>> 1.7.0). one of our interesting genes in our study was the probe set >>> 208078_s_at which is mapped using the hgu133plus2 package (and the >>> annotation from Affymetrix) to NM_030751 (TCF8). we made real time >>> PCR to confirm its regulation but this TCF8 was not regulated at >>> all. so i went back to NetAffx tool and saw that Affymetrix gave a >>> new annotation for this probe set, all 11 probes perfectly match >>> with the gene NM_173354 (SNF1LK) (i wonder how Affymetrix could >>> annotate this gene to the TCF8 gene). >>> >>> if the annotation packages are built using the downloadable probe >>> set to genbank mapping files from Affymetrix (which may never >>> updated and corrected for errors by Affymetrix) i believe that there >>> may be more annotation errors in this package like the one we found. >>> >>> cheers, jo >>> >>> _______________________________________________ >>> Bioconductor mailing list >>> Bioconductor@stat.math.ethz.ch >>> https://stat.ethz.ch/mailman/listinfo/bioconductor >>> >> >> > > > >
ADD REPLY
0
Entering edit mode
great!!! this sounds nice, the only thing that i will have to update the local BioMart database frequently :( . Quoting sdurinck@ebi.ac.uk: > Hi Jo, > > You could do a local install of the Ensembl biomart database. Normally > the latest Ensembl release is downloadable from > http://www.ebi.ac.uk/BioMart/install.html > but I am not able to access this right now. If you then change in biomaRt > the host to your local host you should be able to use it. You need to do > this in the package R code but I'll update the package later this week so > you can do this from within R. > > best, > Steffen > >> the biomaRt package is very nice, >> i also downloaded it and tested it, but as i am behind a (****) http >> proxy i am not able to connect to the according servers at ensembl. i >> assume the package connects directly through a SQL connection to the >> database... would it be possible to use a web service? >> >> cheers, jo >> >> Quoting Steffen Durinck <sdurinck@ebi.ac.uk>: >> >>> Hi, >>> >>> You could also try using the biomaRt package (see developmental >>> packages). biomaRt uses direct queries to the latest release of >>> Ensembl to get annotation information. Ensembl does an independent >>> mapping of the affy probe sequences to the genome sequences. This is >>> what I got for your affy id: >>> >>> > library(biomaRt) >>> > mart<-martConnect() >>> - Connected to ensembl_mart_29, ensembl_mart_29, ensembl_mart_29 and >>> ensembl_mart_29 - >>> > getGene(id="208078_s_at",array="hgu133plus2",mart=mart) >>> An object of class "martTable" >>> Slot "id": >>> [1] "208078_s_at" >>> >>> Slot "table": >>> $symbol >>> [1] "SNF1LK" >>> >>> $description >>> [1] "Probable serine/threonine-protein kinase SNF1LK (EC 2.7.1.37). >>> [Source:Uniprot/SWISSPROT;Acc:P57059]" >>> >>> $band >>> [1] "q22.3" >>> >>> $chromosome >>> [1] "21" >>> >>> $start >>> [1] 43658828 >>> >>> $end >>> [1] 43671436 >>> >>> $martID >>> [1] "ENSG00000142178" >>> >>> >>> Cheers, >>> Steffen >>> >>> Dipl.-Ing. Johannes Rainer wrote: >>> >>>> hi, >>>> >>>> i am wondering how the annotation of the Affymetrix chips are >>>> done... i know how the AnnBuilder package works and i also used it, >>>> but as the starting point for any annotation is the GenBank id for >>>> which the probe set codes i would like to know from where this >>>> mapping information comes. are those packages built using the probe >>>> set to genbank mapping that can be downloaded from affymetrix, or >>>> the (sometimes) newer annotation that is available through >>>> Affymetrix NetAffx or some other source? >>>> >>>> I am really interested in this, because we got a little annotation >>>> problem with the annotation for the hgu133plus2 chip (version >>>> 1.7.0). one of our interesting genes in our study was the probe set >>>> 208078_s_at which is mapped using the hgu133plus2 package (and the >>>> annotation from Affymetrix) to NM_030751 (TCF8). we made real time >>>> PCR to confirm its regulation but this TCF8 was not regulated at >>>> all. so i went back to NetAffx tool and saw that Affymetrix gave a >>>> new annotation for this probe set, all 11 probes perfectly match >>>> with the gene NM_173354 (SNF1LK) (i wonder how Affymetrix could >>>> annotate this gene to the TCF8 gene). >>>> >>>> if the annotation packages are built using the downloadable probe >>>> set to genbank mapping files from Affymetrix (which may never >>>> updated and corrected for errors by Affymetrix) i believe that there >>>> may be more annotation errors in this package like the one we found. >>>> >>>> cheers, jo >>>> >>>> _______________________________________________ >>>> Bioconductor mailing list >>>> Bioconductor@stat.math.ethz.ch >>>> https://stat.ethz.ch/mailman/listinfo/bioconductor >>>> >>> >>> >> >> >> >> > >
ADD REPLY
0
Entering edit mode
Aedin Culhane ▴ 310
@aedin-culhane-500
Last seen 10.3 years ago
Hi Johannes Maybe have a look at the EnsEMBL, EnsMart annotation. It permits visualization of the probe matches to the sequence which can be handy. I think your probe matches to: http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000142178 You'll prob need to follow the Affy links out from there, Hope this is useful Aedin > >Message: 9 >Date: Mon, 04 Apr 2005 10:28:00 +0200 >From: "Dipl.-Ing. Johannes Rainer" <johannes.rainer@tugraz.at> >Subject: [BioC] annaffy annotation problem (hgu133plus2) >To: "bioconductor@stat.math.ethz.ch" <bioconductor@stat.math.ethz.ch> >Message-ID: <20050404102800.4zxbhgwaqoccko8o@webmail.tugraz.at> >Content-Type: text/plain; charset=ISO-8859-1; format="flowed" > >hi, > >i am wondering how the annotation of the Affymetrix chips are done... i >know how the AnnBuilder package works and i also used it, but as the >starting point for any annotation is the GenBank id for which the probe >set codes i would like to know from where this mapping information >comes. are those packages built using the probe set to genbank mapping >that can be downloaded from affymetrix, or the (sometimes) newer >annotation that is available through Affymetrix NetAffx or some other >source? > >I am really interested in this, because we got a little annotation >problem with the annotation for the hgu133plus2 chip (version 1.7.0). >one of our interesting genes in our study was the probe set 208078_s_at >which is mapped using the hgu133plus2 package (and the annotation from >Affymetrix) to NM_030751 (TCF8). we made real time PCR to confirm its >regulation but this TCF8 was not regulated at all. so i went back to >NetAffx tool and saw that Affymetrix gave a new annotation for this >probe set, all 11 probes perfectly match with the gene NM_173354 >(SNF1LK) (i wonder how Affymetrix could annotate this gene to the TCF8 >gene). > >if the annotation packages are built using the downloadable probe set >to genbank mapping files from Affymetrix (which may never updated and >corrected for errors by Affymetrix) i believe that there may be more >annotation errors in this package like the one we found. > >cheers, jo > > > >------------------------------ > >_______________________________________________ >Bioconductor mailing list >Bioconductor@stat.math.ethz.ch >https://stat.ethz.ch/mailman/listinfo/bioconductor > > >End of Bioconductor Digest, Vol 26, Issue 4 >******************************************* >
ADD COMMENT
0
Entering edit mode
Glynn, Earl ▴ 170
@glynn-earl-952
Last seen 10.3 years ago
> Date: Mon, 04 Apr 2005 10:28:00 +0200 > From: "Dipl.-Ing. Johannes Rainer" <johannes.rainer@tugraz.at> > Subject: [BioC] annaffy annotation problem (hgu133plus2) > i am wondering how the annotation of the Affymetrix chips are done... i > know how the AnnBuilder package works and i also used it, but as the > starting point for any annotation is the GenBank id for which the probe > set codes i would like to know from where this mapping information > comes. are those packages built using the probe set to genbank mapping > that can be downloaded from affymetrix, or the (sometimes) newer > annotation that is available through Affymetrix NetAffx or some other > source? According to this page http://www.affymetrix.com/support/technical/byproduct.affx?product=hg- u1 33-plus Affymetrix updated its HG-U133_Plus_2 Annotations files a few days ago. The CSV file version of the annotations was updated on 1 April 2005. I'm not sure why Affy updated the BLASTP Annotation file on 1 April 2005 but the BLASTX file supposedly hasn't been updated since 23 June 2004. The CSV annotations file gives the known mappings to various other data sources for all the probesets on a GeneChip. Annotation information on some of the probesets changes during every update from Affy. Sometimes probes that were mapped to known genes no longer have that mapping, and with new information (presumably) some unannotated probesets are given new information. I would assume that the NetAffx information would be the same as these annotations files? Does anyone know if NetAffx information is continuously updated and these annotations files are only updated periodically, or should the information be the same from either Affy source? I have worked mostly with various mouse GeneChips and Affy seems to update them about every quarter, but I cannot get anyone at Affy to state what their update cycle really is. I have not used annaffy much yet, but I'm curious how soon after the updates appear on the Affy web site does this information show up in the annaffy package? For the last year or so I've been loading these Affy CSV annotation files for certain mouse chips into an Access database and trying to track what changes Affy is making to their files: http://research.stowers- institute.org/efg/ScientificSoftware/Application s/Affy/Annotations/index.htm The contents of these files seems to have stabilized after changing considerably between Affy updates a year ago. However, in the most recent update (last week), the "LocusLink" field was renamed to "Entrez Gene" in MOE430A but was still "Locuslink" in Mouse430_2. It is frustrating that Affy doesn't explain changes it makes to these files, and one must reverse engineer the contents of the files and compare old with new for every update. Affy doesn't give much explanation of the various annotation fields on their page, but the information there is useful: Probe Set Data in Tabular Format https://www.affymetrix.com/support/technical/manual/taf_manual.affx Exactly how Affy connects their probesets to the other data sources via their Annotations file is still a bit of a mystery to me. I'm curious where Affy's "Annotation Date" is in the data provided by annaffy? efg -- Earl F. Glynn Scientific Programmer Stowers Institute for Medical Research [[alternative HTML version deleted]]
ADD COMMENT

Login before adding your answer.

Traffic: 523 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6