From the 450k manifest data that I retrieve using getAnnotation() in minfi, how do I interpret variables UCSC_RefGene_Name and UCSC_RefGene_Group when there are several genes or groups listed (separated by a semicolon) for the same probe (that is, in the same row)?
Example:
For probe X, I have:
UCSC_RefGene_Name: Gene1;Gene2;Gene1
UCSC_RefGene_Group: TSS200;Body;1stExon
My guess for this probe would be that it maps to three distinct transcripts: 1. In TSS200 of first Gene1 transcript; 2. In Body of Gene2 transcript; 3. in 1stExon of second Gene1 transcript. Is this correct?
This annotation is provided by Illumina and is unchanged. I don't use it
myself, and have no comments on it.
Best,
Kasper
On Fri, Oct 2, 2015 at 4:31 PM, maden.sean [bioc] <noreply@bioconductor.org>
wrote:
> Activity on a post you are following on support.bioconductor.org
>
> User maden.sean <https: support.bioconductor.org="" u="" 8348=""/> wrote Question:
> Illumina 450k Manifest: Probes Mapping to Multiple RefGene Groups and Names
> <https: support.bioconductor.org="" p="" 72915=""/>:
>
> From the 450k manifest data that I retrieve using getAnnotation() in
> minfi, how do I interpret variables UCSC_RefGene_Name and
> UCSC_RefGene_Group when there are several genes or groups listed (separated
> by a semicolon) for the same probe (that is, in the same row)?
>
> Example:
>
> For probe X, I have:
>
> UCSC_RefGene_Name: Gene1;Gene2;Gene1
>
> UCSC_RefGene_Group: TSS200;Body;1stExon
>
> My guess for this probe would be that it maps to three distinct
> transcripts: 1. In TSS200 of first Gene1 transcript; 2. In Body of Gene2
> transcript; 3. in 1stExon of second Gene1 transcript. Is this correct?
>
> Best,
>
> Sean
>
>
>
>
>
>
>
> ------------------------------
>
> Post tags: illuminahumanmethylation450kanno.ilmn12.hg19, minfi,
> illuminahumanmethylation450k.db, methylation
>
> You may reply via email or visit Illumina 450k Manifest: Probes Mapping to Multiple RefGene Groups and Names
>
The manifest has a description of each entry available here. It states that different spliceforms are listed, and the Groups are listed in that same order. I would say that your interpretation is correct