Unknown gene transcripts after annotatr
1
0
Entering edit mode
Zach • 0
@75d674ce
Last seen 12 hours ago
United States

I used annotatr to annotate my methylation data and got four transcripts that I can't find any information for, no ensembl gene id translation or gene symbol translation: ENST00000602953.1 ENST00000552238.1 ENST00000451960.1 Any idea where these are coming from?

## create Granges object
regions <- GRanges(
  seqnames = sig_windows_filtered$chrom,
  ranges = IRanges(sig_windows_filtered$start, sig_windows_filtered$end),
  mcols = list(
    diff = sig_windows_filtered$diff, 
    type = sig_windows_filtered$type
  )
)

# Select annotations for intersection with regions
annots = c('hg38_cpgs', 'hg38_basicgenes', 'hg38_enhancers_fantom')

# Build the annotations (a single GRanges object)
annotations = build_annotations(genome = 'hg38', annotations = annots)

# Intersect the regions read in with the annotations
dm_annotated = annotate_regions(
  regions = regions,
  annotations = annotations,
  ignore.strand = TRUE,
  quiet = FALSE)

# Coerce to a data.frame
df_dm_annotated = data.frame(dm_annotated)
annotatr • 26 views
ADD COMMENT
0
Entering edit mode
@james-w-macdonald-5106
Last seen 12 minutes ago
United States

Both NCBI and Ensembl are constantly pruning IDs. I would assume those have all been removed. For example.

Login before adding your answer.

Traffic: 926 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6